Skip to main content

Welcome, Cognitive Revolution fans!

Intuitive visualizations allow you to interrogate every step of your LLM program.
Created on August 29|Last edited on September 5

Introducing Weave from Weights & Biases

Evaluation comparison for MixEval hard benchmark
Weave is a game-changing toolkit for developers working on generative AI applications. It offers a seamless way to track, evaluate, and debug LLM-based projects.
You can use Weave to:
  1. Log and debug language model inputs, outputs, and metadata
  2. Build rigorous, apples-to-apples evaluations for language model use cases
  3. Organize all the information generated across the LLM workflow, from experimentation to evaluations to production

Check out the full capabilities of Weave




🧑‍💻 All you need are 3 lines of code

Get started by decorating your Python functions with @weave.op:
import weave
weave.init("cognitive-revolution")

@weave.op()
def sum_nine(value_one: int):
return value_one + 9

@weave.op()
def multiply_two(value_two: int):
return value_two * 2

@weave.op()
def main():
output = sum_nine(3)
final_output = multiply_two(output)
return final_output
You can try Weave out in this interactive Colab:



♾️ Weave supports any use case

RAG applications, image generation, benchmarking, Weave can support you in any enterprise, academic or side project and here are some projects powered by Weave:


🤝 Weave lives where you work

Weave offers integrations with many language model APIs and LLM frameworks to streamline tracking and evaluation:


Stay focused on iterating on your prompts and models with whichever API or LLM framework you know and love - Weave has integrations for OpenAI, Anthropic, Mistral.ai, LlamaIndex, Cohere and more.
See our integrations


🧑‍🎓 Explore Weave and build LLM apps with our free courses

Our courses will give you a theory and code to dive into the area of your interest. Our LLM courses range from short, problem focused courses to a more advanced deeper dives. Each course is equipped with theory, code and led by industry experts.


Check out all our courses (and enroll for free!)


Trusted by over 200,000 machine learning practitioners

Weights & Biases is trusted as the machine learning system of record by organizations across the work; from the largest deep learning research labs to autonomous driving companies and from pharmaceutical companies working on drug discovery to financial institutions. And with SOC2 certification and both cloud and local deployment options, Weights & Biases will meet your teams' security needs.
Enterprises that trust Weights & Biases
Try Weights & Biases today