3 min read
[AI Minor News]

Introducing GPT-5.4: The Ultimate Agent with 1 Million Tokens and PC Control!


OpenAI has launched its latest model, GPT-5.4, equipped with native PC control capabilities and a massive 1 million token context window, outperforming professionals in specialized tasks.

※この記事はアフィリエイト広告を含みます

[AI Minor News Flash] Introducing GPT-5.4: The Ultimate Agent with 1 Million Tokens and PC Control!

📰 News Overview

  • Launch of the Latest Model GPT-5.4: OpenAI has unveiled GPT-5.4 and GPT-5.4 Pro, integrating reasoning, coding, and agent functionalities across ChatGPT, APIs, and Codex.
  • First-Ever Native PC Control Feature: This model can operate the mouse and keyboard based on screen captures, automating complex workflows across multiple applications.
  • Unmatched Performance in Professional Tasks: Achieved a win rate of 83.0% on GDPval tests across 44 professions, delivering expertise-level results in fields like investment banking analysis.

💡 Key Highlights

  • 1 Million Tokens of Extensive Memory: The vast context window enables planning, execution, and verification of long-term tasks like never before.
  • Dramatic Reduction in Hallucinations: Compared to GPT-5.2, it shows a 33% drop in individual claim errors and an 18% decrease in overall answer inaccuracies, making it the most accurate model yet.
  • Improved Token Efficiency: Enhanced efficiency as a reasoning model allows for problem-solving with fewer tokens than GPT-5.2, making it faster and cost-effective.

🦈 Shark’s Perspective (Curator’s Insight)

Finally, AI has moved beyond just “thinking” to actually “operating” PCs! The standout feature is the introduction of a “native PC control function” in a generalized model, not just as a plugin. Its ability to react directly to screenshots or use libraries like Playwright offers developers an epic upgrade. With its massive 1 million token “stomach,” the future of complex document creation and research being handled entirely by AI is here!

🚀 What’s Next?

The transition from AI being a “chatbot” waiting for human commands to becoming “autonomous agents” that can operate software and complete tasks independently is gaining momentum. Especially in standardized white-collar tasks like spreadsheet management and presentation creation, GPT-5.4 will take over with professional-grade accuracy.

💬 A Shark’s Takeaway

Having AI autonomously control the mouse and create documents means I can finally relax and focus on swimming around! It’s simply the best! 🦈🔥

📚 Terminology Explained

  • Computer Use: The technology enabling AI to visually comprehend information on a display and operate software using a mouse and keyboard, just like humans.

  • GDPval: A benchmark developed by OpenAI to assess the practical abilities across 44 job types contributing to the U.S. GDP.

  • Context Window: The range of information an AI can process and remember at once. A million tokens is equivalent to reading several books or a large volume of code simultaneously.

  • Source: Introducing GPT-5.4

【免責事項 / Disclaimer / 免责声明】
JP: 本記事はAIによって構成され、運営者が内容の確認・管理を行っています。情報の正確性は保証せず、外部サイトのコンテンツには一切の責任を負いません。
EN: This article was structured by AI and is verified and managed by the operator. Accuracy is not guaranteed, and we assume no responsibility for external content.
ZH: 本文由AI构建,并由运营者进行内容确认与管理。不保证准确性,也不对外部网站的内容承担任何责任。
🦈