[AI Minor News Flash] License Evasion via AI Reimplementation? The Copyleft Crisis Behind the Speed Boost of ‘chardet’
📰 News Summary
- The latest version (v7.0) of the Python character encoding detection library ‘chardet’ has been completely reimplemented from scratch using the AI ‘Claude’.
- The AI-driven redesign achieved a 48-fold speed increase and multi-core support, but the license has shifted from the restrictive LGPL to the more permissive MIT.
- Original author Mark Pilgrim and others are opposing this move, with the debate focusing on whether AI reimplementation serves as a means to evade the obligations of copyleft (freedom to share).
💡 Key Points
- Maintainer Dan Blanchard claims that by not directly referencing existing code and only providing APIs and tests to the AI, the result is a legally “independent new work.”
- Verification using code similarity tool JPlag shows less than 1.3% similarity with the old version, resembling a legal “clean room” approach.
- This situation highlights the divergence between what is “legally permissible” and what is “ethically legitimate” within the community.
🦈 Shark’s Eye (Curator’s Perspective)
This is a shocking method that could be called “license laundering” through AI! By passing existing strict licenses through the black box of AI, it’s possible to effectively reset constraints while maintaining functionality. A 48-fold speed boost is an incredible technical achievement, but it’s also a legal act of breaking down the “fence” meant to protect the shared property of open source. While it may be legally safe, if this becomes commonplace, it could lead to the collapse of the copyleft culture itself—an extremely “sharp” news piece indeed!
🚀 What’s Next?
If AI-driven code reimplementation establishes itself as an “independent work,” the powerful constraints of existing licenses like GPL could be effectively nullified, accelerating commercial use by companies but raising concerns about reduced contributions back to the open-source community.
💬 Sharky’s Take
Whose code is it if AI wrote it? The license battles among humans are fiercer than a shark turf war! 🦈🔥
📚 Terminology
-
Copyleft: An obligation to apply the same free license when distributing modified works, keeping software in the realm of “shared property.”
-
Clean Room Method: A development technique that creates the same functionalities from scratch based solely on specifications, without directly viewing existing source code, used to avoid copyright infringement.
-
LGPL: A license that allows the use of libraries without publishing one’s own code under specific conditions like dynamic linking, but requires publication for modifications made to the library itself.
-
Source: Is legal the same as legitimate: AI reimplementation and the erosion of copyleft