David Silva is CTO at Algemetric with the mission to deliver secure, collaborative and purpose-driven business intelligence for everyone.
A powerful collection of artificial intelligence (AI) technologies capable of generating virtually any kind of text, drawings, videos, code…you name it. This is generative AI (GAI), and its current capabilities are truly impressive. But with great generative power comes plenty of risks. Many users of GAI tools seem to ignore (or even worse, don’t know) that these tools make mistakes.
Despite warnings like “ChatGPT can make mistakes. Consider checking important information” and “Gemini may display inaccurate info, including about people, so double-check its responses,” some people use these tools for creating legal contracts, corporate policies, course curricula, protocols for treating health conditions, financial reports, commercial proposals, and software code, among many other examples, without having the competence, experience, and discernment to assess the accuracy and applicability of the generated material to each case. Needless to say, such practices can lead to catastrophic consequences.
Not surprisingly, some of these GAI tools are often used as a replacement for search engines. I once asked ChatGPT for a research paper I could use as a reference for the known risks of solely using anonymization and pseudonymization techniques for protecting sensitive information. ChatGPT provided the paper’s title, the authors’ names and affiliations, the conference the paper was presented at, and the paper’s abstract. I investigated the information I received and discovered that the paper didn’t exist. When I prompted ChatGPT that I was looking for a paper that existed in the real world, I got the response that since I was interested in something real, it would change the answer to accommodate the specifics of my requirements.
Providing false sources is just one common error from GAI tools. Other problems include failing trivial logic and mathematics rules, violating word limits, overlooking requirements from a long list, and not observing the context before providing an answer, among others. Now imagine what kind of problems one can face by treating as final any content produced by GAI tools.
The research paper episode shows us that the quality of the content produced by GAI tools heavily depends on the quality of the prompts they receive. The more complex the task, the longer and more elaborate the required prompt for a high-quality AI-generated response will be. In GAI lingo, a prompt is a specific input that requests the tool to generate a particular output type. Such information turned out to be so critical in the GAI world that we now have a profession named prompt engineering, which involves designing and refining prompts to improve the performance and accuracy of language models for specific tasks. This can include fine-tuning pre-trained models or developing new models from scratch.
Although GAI strives to emulate humans, it necessitates detailed instructions to generate an output of high quality and relevance. According to AWS, “In prompt engineering, you choose the most appropriate formats, phrases, words and symbols that guide the AI to interact with your users more meaningfully. Prompt engineers use creativity plus trial and error to create a collection of input texts, so an application’s generative AI works as expected.” It should be evident that good outcomes won’t be produced without a reasonable level of expertise in applicable areas.
Regardless of the GAI tool, I like to refer to them as “the intern.” And this is precisely how I treat them: as non-authoritative, non-experienced and non-discerning assistants. In other words, nothing GAI tools produce should ever be considered final. Instead, AI-generated content should always be scrutinized by people who combine these virtues: competence, experience and discernment. These are three interconnected yet distinct aspects that a seasoned professional in any area must display.
Experience brings the notion of “I have been there” and “I have done that,” which is often crucial to providing the best answer to any given question, considering details from applicable contexts. Competence is related to having the skills and knowledge required to execute a specific task properly. Discernment is critical for detecting wrong answers, avoiding common pitfalls, reducing unnecessary trial and error, and optimizing each effort.
Sam Altman, CEO of OpenAI (the creators of ChatGPT), remarks that “All repetitive human work that does not require a deep emotional connection between two people will all be done in the next couple decades better, cheaper and faster by AI.” Notice two things in Altman’s statement: First, we are not there yet. Second, AI will probably never fully replace the human factor. Time will tell.
GAI can be very useful for speeding up routine tasks, helping identify errors and quickly executing repetitive, well-defined standard procedures. The generated content must make sense in a broader context than a single task. AI can easily miss that.
I do not recommend that anyone learn anything from GAI tools solely, as they can be great assistants but terrible teachers. As per the current status of the most advanced AI tools, they are unsuitable for replacing the subject matter expert, just like an intern cannot replace an experienced professional. It should be literally the other way around: GAI tools must learn from you.
Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?