Open source AI is already finding its way into production
GitHubOpen source AI models are in widespread use, enabling developers around the world to build custom AI solutions and host them where they choose.
Open source AI models are in widespread use, enabling developers around the world to build custom AI solutions and host them where they choose.
While debates about the definition of Open Source AI continue, with the Open Source Initiative (OSI) recently publishing its first draft, this ambiguity hasn’t slowed the adoption of modern AI models.
There are two broad kinds of licenses that meet the formal open source definition as laid out by the Open Source Initiative (OSI).
In October 2024, the Open Source Initiative established a definition for open-source AI. According to the initiative, open-source AI has four key elements: the freedom to use, study, modify, and share the system for any purpose.
The Open Source Initiative (OSI) defines what open source is. For a model to truly be open source, its license should allow, among other things, the free redistribution and creation of derivative works, for example, by fine-tuning the model. If this permission is not granted, it’s not open source.
While the details are still being worked out, there can be no question whatsoever that AI and open source will continue to work together.
Mozilla is actively working for more open source standards and community. We explored the progress the industry has made in our Trustworthy AI Report and partnered with others to call for standards ahead of the Open Source Initiative (OSI)’s creation of the first Open Source AI definition.
Technically, Meta’s Llama large language model does not meet the definition of open source as defined by the Open Source Initiative in a definition updated only a month ago to grapple with a wave of AI models.
While there’s no shortage of “open” language models to choose from (see: Meta’s Llama), OLMo 2 meets the Open Source Initiative’s definition of open source AI, meaning the tools and data used to develop it are publicly available. The Open Source Initiative, the long-running institution aiming to define and “steward” all things open source, finalized its open source AI definition in October. But the first OLMo models, released in February, met the criterion as well.
The company points out that Tülu 3 and Ai2’s other models are fully open source, noting that big model trainers like Anthropic and Meta, who claim to be open source, have “none of their training data nor training recipes are transparent to users.” The Open Source Initiative recently published the first version of its open-source AI definition, but some organizations and model providers don’t fully follow the definition in their licenses.
Mer Joyce: I became involved in open source AI by being asked to lead co-design for OSI’s Open Source AI Definition. Like the Open Source Definition before it, the Open Source AI Definition will have a global impact. It needs to be based on global input. And my co-design firm, Do Big Good, does that work.
Unfortunately for them, though, guidelines published last week by the Open Source Initiative (OSI), an American non-profit, have suggested that the modern use of the term by tech giants has become stretched into meaninglessness. Burdened with restrictions and developed in secrecy, these free products are never going to power a true wave of innovation unless something changes, the OSI says. It is the latest salvo in a lively debate: what does open source really mean in the age of AI?