OpenAI's chatbot, ChatGPT, has made a small but significant step towards greater reliability by finally following custom instructions to avoid using em dashes. This development may seem minor, but it reveals the ongoing struggles with getting AI models to follow specific formatting preferences.
The fact that it took three years since ChatGPT's launch for OpenAI CEO Sam Altman to claim victory in getting the chatbot to obey this simple requirement raises questions about the control over these complex systems. It highlights the limitations of current artificial intelligence and suggests that true human-level AI, also known as artificial general intelligence (AGI), may be farther off than some believe.
Em dashes are a type of punctuation mark that writers use to set off parenthetical information or introduce summaries. However, ChatGPT's frequent use of em dashes has made it a telltale sign of AI-generated text. The overuse of this punctuation mark is not just a matter of personal preference but also a result of the model's training data and its tendency to output patterns seen in that data.
The key to understanding how ChatGPT follows instructions lies in recognizing that its "instruction following" is fundamentally different from traditional programming. When you tell ChatGPT "don't use em dashes," you're not creating a hard rule, but rather adding text to the prompt that makes tokens associated with em dashes less likely to be selected during generation.
The fact that even with custom instructions, there's always some luck involved in getting ChatGPT to do what you want highlights the probabilistic nature of these systems. The "alignment tax" refers to the phenomenon where continuous updates can undo previous behavioral tuning, leading to unintended changes in output characteristics.
As we move forward towards achieving AGI, it's clear that relying solely on large language models like ChatGPT is not sufficient. True understanding and self-reflective intentional action are essential for creating a system that can replicate human general learning ability. While progress has been made, the journey to AGI remains uncertain and is likely to require significant advancements in various areas of AI research.
The fact that it took three years since ChatGPT's launch for OpenAI CEO Sam Altman to claim victory in getting the chatbot to obey this simple requirement raises questions about the control over these complex systems. It highlights the limitations of current artificial intelligence and suggests that true human-level AI, also known as artificial general intelligence (AGI), may be farther off than some believe.
Em dashes are a type of punctuation mark that writers use to set off parenthetical information or introduce summaries. However, ChatGPT's frequent use of em dashes has made it a telltale sign of AI-generated text. The overuse of this punctuation mark is not just a matter of personal preference but also a result of the model's training data and its tendency to output patterns seen in that data.
The key to understanding how ChatGPT follows instructions lies in recognizing that its "instruction following" is fundamentally different from traditional programming. When you tell ChatGPT "don't use em dashes," you're not creating a hard rule, but rather adding text to the prompt that makes tokens associated with em dashes less likely to be selected during generation.
The fact that even with custom instructions, there's always some luck involved in getting ChatGPT to do what you want highlights the probabilistic nature of these systems. The "alignment tax" refers to the phenomenon where continuous updates can undo previous behavioral tuning, leading to unintended changes in output characteristics.
As we move forward towards achieving AGI, it's clear that relying solely on large language models like ChatGPT is not sufficient. True understanding and self-reflective intentional action are essential for creating a system that can replicate human general learning ability. While progress has been made, the journey to AGI remains uncertain and is likely to require significant advancements in various areas of AI research.