Ai2 Releases New Language Models to Rival Meta’s Llama

A groundbreaking addition to AI technology has emerged with the release of a new language model family, one of the few designed to be fully reproducible from scratch.

On Tuesday, Ai2, the nonprofit AI research organization established by the late Microsoft co-founder Paul Allen, unveiled OLMo 2, the second generation in its OLMo (Open Language Model) series. While numerous “open” models like Meta’s Llama exist, OLMo 2 meets the stringent standards set by the Open Source Initiative (OSI) for open-source AI, ensuring that all tools and datasets used in its development are publicly accessible.

The Definition of Open Source AI

The Open Source Initiative, a longstanding authority on all things open source, finalized its definition for open-source AI in October. However, Ai2’s first OLMo models, launched in February, already adhered to these principles.

In a blog post, Ai2 emphasized the transparency of OLMo 2’s development, stating:
“OLMo 2 was created using open and accessible training data, open-source training codes, reproducible recipes, transparent evaluations, and intermediate checkpoints. By openly sharing our data, processes, and insights, we aim to empower the open-source community to explore innovative new approaches.”

Two Models: OLMo 7B and OLMo 13B

The OLMo 2 series introduces two models:

OLMo 7B with 7 billion parameters
OLMo 13B with 13 billion parameters

In AI, parameters are akin to neurons in a brain, determining a model’s problem-solving capabilities. Typically, models with more parameters exhibit superior performance.

Like other advanced language models, OLMo 2 7B and 13B can handle diverse text-based tasks, including answering questions, summarizing documents, and writing code.

The models were trained on an expansive dataset of 5 trillion tokens—the equivalent of approximately 3.75 trillion words. The dataset included curated high-quality web content, academic papers, Q&A forums, and both human-crafted and synthetic math workbooks.

Performance on Par with Meta’s Llama 3.1

Ai2 asserts that OLMo 2 matches or surpasses the performance of competitive open models like Meta’s Llama 3.1. The organization reported:
“Not only do we see dramatic improvements across all tasks compared to the earlier OLMo model, but notably, OLMo 2 7B outperforms Llama 3.1 8B.”

The OLMo 2 models are available for download on Ai2’s website under the Apache 2.0 license, permitting commercial use.

Safety Concerns in Open Models

The release of open-source models like OLMo has reignited debates about their potential misuse. For instance, Meta’s Llama models have reportedly been utilized by Chinese researchers for developing defense tools.

When questioned about such risks earlier this year, Ai2 engineer Dirk Groeneveld acknowledged the possibility but highlighted the overarching benefits:
“Yes, open models may be misused or exploited for unintended purposes. However, this approach fosters technical advancements that lead to more ethical models, ensures verification and reproducibility—possible only with full access—and helps decentralize power, enabling more equitable access.”

A Step Toward Open AI Progress

OLMo 2 demonstrates Ai2’s commitment to both innovation and transparency in AI. By setting a new standard for open-source development, the organization paves the way for researchers and developers to create AI models that are not only powerful but also ethical and accessible.