Skip to main content

Meta Releases LLaMA 3.1 405B: Smarter than GPT-4o?

Meta's newest model!
Created on July 23|Last edited on July 23
Meta has unveiled Llama 3.1, its most advanced large language model to date, continuing its commitment to making AI more accessible through open-source initiatives. Llama 3.1 features expanded capabilities, including a significantly longer context length of 128K, support for eight languages, and the introduction of the Llama 3.1 405B model, a groundbreaking frontier-level AI model.

Llama 3 model upgrades and capabilities

Llama 3.1 includes upgraded 8B and 70B models, now with enhanced multilingual capabilities and longer context lengths. This enables advanced applications such as long-form text summarization, multilingual conversational agents, and coding assistants. Meta's updated licensing allows developers to use outputs from Llama models to enhance other models, reinforcing its commitment to open-source principles.

Llama 3.1 performance and evaluation

The evaluation of Llama 3.1 involved over 150 benchmark datasets and extensive human evaluations, revealing that the flagship model competes well with leading models like GPT-4 and Claude 3.5 Sonnet across a variety of tasks. The new model architecture, a standard decoder-only transformer, was trained on over 15 trillion tokens using 16,000 H100 GPUs.
Llama 3 8B is Llama 3.1

Post-training and fine-tuning

Post-training, Llama 3.1 employs several 'rounds' of alignment and fine-tuning, using synthetic data generation to enhance its capabilities. This iterative approach ensures the model remains highly responsive to user instructions and maintains high safety standards. The model also supports large-scale production inference, with quantization techniques reducing compute requirements.

The Llama system

The Llama system integrates various components, such as Llama Guard 3 for multilingual safety and Prompt Guard for prompt injection filtering, all designed to work seamlessly with the core model. Meta is also introducing the Llama Stack API, a standardized interface intended to facilitate easier integration and interoperability within the AI ecosystem. This initiative is supported by a wide range of industry partners, ensuring robust ecosystem support from day one.

Meta's open-source approach and developer empowerment

Meta's open-source approach allows developers to customize and deploy Llama models across various environments, promoting a more equitable distribution of AI technology. The Llama 3.1 release is expected to inspire a new wave of AI applications and research, driving advancements in model distillation and large-scale inference.

The Llama 3 ecosystem Support and Deployment

By offering comprehensive support through partners like AWS, NVIDIA, and Databricks, Meta ensures that developers can leverage the full potential of Llama 3.1. The collaboration with community projects like vLLM, TensorRT, and PyTorch further enhances the model's deployment readiness.
The Announcement: https://llama.meta.com/
Tags: ML News
Iterate on AI agents and models faster. Try Weights & Biases today.