Shawn Rosemarin is the Global Vice President, R&D, Customer Engineering at Pure Storage.
There’s no question that the wave of AI excitement is justified. Over the next 15 years, I believe predictive and generative AI innovations will deliver a massive leap in productivity for individuals, corporations, governments and the world at large. I’d also expect entire business models to transform and trillion-dollar companies to emerge from early innovation and investments in this area.
But as we head into the baseball season, let’s remember that when it comes to the “era of AI,” we’re still in spring training—an important testing ground for new plays and strategies where teams prepare for the regular season before every game counts.
Enterprises are in the business of what’s possible as well as the business of what’s practical, what drives revenues and productivity, and what reduces cost and risk. Those serious about innovating with AI are already asking themselves:
• Of the 50 ideas on the table, which can be operated at scale in a way where the value is greater than the cost to build and maintain it, what drives that equation?
• How do I build a playbook that gives my team the greatest chance for success?
• What will this innovation’s impact be on our customers, suppliers and value chain?
• What are the security, compliance and governance implications of this new world?
These are important questions, and the answers are not straightforward. The good news is that over decades of progress in personal computing, the internet, e-commerce and, most recently, cloud computing, we’ve been through this before. These technology trends took time to mature and provided a well-trodden path to show us where we are and how this AI era is likely to evolve.
It’s been interesting to see the enthusiasm for AI against the backdrop of supply constraints in graphics processing units (GPUs). While we eventually expect supply to come in line with demand, enterprises and hyperscalers are clamoring for GPUs in a tale of haves versus have-nots in the interim. This constrained supply situation has also greatly impacted organizations wishing to jump in.
These first-movers likely stand to get the best opportunity to establish and own a market, but they’ll also incur the highest costs to train, develop, iterate and prove their models. Soon, versatile pre-trained models will become more widely available (including as subscriptions, as we’ve already seen with general pre-trained models like ChatGPT and Gemini), making it easier for enterprises to buy and extend rather than engineer models from scratch.
We have also seen companies like Glean extend these capabilities across large internal corporate data repositories like Google Drive or Slack. These models have demonstrated the potential of harnessing the power of public and corporate knowledge—solid proof point for the productivity gains that lie ahead.
Four Ways To Prioritize AI
Below are a few thoughts and considerations I’ve shared with enterprises to help evaluate, prioritize and solidify a path forward with AI.
This era of AI development is a little different from the cloud era, where there was essentially limitless capacity to spin up anything. Choosing which AI projects to fund will require more selectivity. No business can afford to fund, power, operate, scale and support every AI, even as a proof-of-concept.
This will require difficult decisions and the courage to choose one project over another. Businesses will also need to tackle the challenge of architecting workloads and data pipelines across CPUs, DPUs and GPUs. In the short term, public cloud providers and SaaS providers will simplify the training process for customers with expanded “as-a-service” offerings, but prioritization will remain critical.
The high cost and shortage of GPUs highlight the need for efficient, scalable data platforms to support AI workloads. This premium price penalizes enterprises that aren’t architected for an efficient data pipeline. Training model data will need to be highly curated and fed with powerful back-end storage to maximize the efficiency of the GPUs, while inference models will need to provide very low latency.
AI architectural models need to prioritize not only raw performance and capacity but also efficiency and density for long-term scalability. Storage plays a crucial role and should be thoroughly assessed. It’s essential to examine all data sources, including traditional databases and unstructured data, to ensure effective ingestion, transformation, training and storage across the entire AI platform.
Countries around the world are starting to run out of power, leading to significant limitations being placed on the buildout and expansion of net new data centers. While we will likely find ways to bring increased amounts of power to our planet, this will be a major limiting factor in the future.
Enterprises will need to think about how they can manage the intense power requirements associated with building and training data sets without choking off existing business applications. Putting power and electricity consumption as a key consideration upfront will help better prepare and manage this risk.
There is no doubt that with the growth of AI, governments will continue looking for ways to protect their citizens and corporations will continue protecting their data. As such, we are likely to see rolling levels of legislation and legal challenges around the use of publicly available data as well as research and content libraries.
Architect your platform with this in mind. When in doubt, simply start by asking what could happen, what would be at risk and whether you’re prepared to pivot should the worst-case scenario occur.
Closing Thoughts
Like a talented baseball rookie, AI has a lot of potential, even if its reality is early in the hype cycle. Ultimately, this means that we have a lot to learn and do to meet current expectations.
To navigate this evolving landscape, businesses must prioritize AI investments, develop efficient data platforms, address challenges of power consumption and put security and data provenance at the forefront of their decision-making. I am excited about the journey ahead and look forward to seeing this budding athlete shine in the major leagues!
Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?