LLM Articles & Tutorials by Weights & Biases

Skip to main content

Contact Log In Sign Up

New

Popular

Prompt Engineering LLMs with LangChain and W&B

Mar 21

Articles, Intermediate, Large Models, NLP, Question Answering, GenAI, Classification, LLM

How Cohere Trains Business-Critical LLMs with the Help of W&B

Weights & Biases Case Studies

Feb 08

Articles, Intermediate, Large Models, NLP, Case Study, LLM

Clear Search

English

Evaluate your RAG pipeline using LLM as a Judge with custom dataset creation (Part 2)

Dec 03

Articles, GenAI, LLM, Evaluations, Financial

The Microsoft Agent Framework: Observability

Oct 01

Articles, Agents, LLM, GenAI

Tutorial: Run inference with Qwen3 235B A22B-2507 Instruct using W&B Inference

Sep 15

Articles, Inference, LLM, Weave

Tutorial: Running inference with OpenAI's GPT OSS 20B using W&B Inference

Sep 09

Articles, LLM, Inference

Tutorial: Running inference with Llama 3.1 8B using W&B Inference

Sep 09

Articles, Inference, LLM

Tutorial: Running inference with Qwen3 235B A22B Thinking-2507 using W&B Inference

Sep 09

Articles, LLM, Inference

Tutorial: Running inference with Zhipu AI's GLM-4.5 using W&B Inference

Sep 09

Articles, LLM, Inference

Tutorial: Running inference with Llama 3.3 70B using W&B Inference

Sep 04

Articles, LLM, Inference

How to evaluate the "true" context length of your LLM using RULER

Sep 03

Articles, LLM, GenAI, Evaluations

Tutorial: Running inference with Llama 4 Scout using W&B Inference

Aug 28

Articles, LLM, Inference

Weights & Biases supports BT Group with safe and effective AI deployment

Anthony Kolodynski

Aug 27

Articles, LLM

How to build research agents with W&B Weave and Tavily

Aug 25

Articles, LLM

Tutorials: GPT-5 evaluation across multiple tasks

Aug 13

Articles, GPT, OpenAI, LLM, Evaluations, Agents

A guide to LLM debugging, tracing, and monitoring

Aug 12

Articles, Community Posts, LLM, Weave

Tutorial: Running inference with Kimi K2 using W&B Inference

Aug 11

Articles, LLM, GenAI, Agents, Inference

Exploring LLM evaluations and benchmarking

Aug 11

Articles, Community Posts, LLM, Weave, GenAI, Evaluations

OpenAI GPT OSS models on W&B Inference

Chander Matrubhutam

Aug 06

Articles, Weave, LLM, OpenAI

Amazon Bedrock AgentCore observability guide

Jul 28

Articles, LLM, GenAI, Agents, Evaluations

What is RLHF? Reinforcement learning from human feedback for AI alignment

Jul 28

Articles, Reinforcement Learning, GenAI, LLM, Evaluations, Tutorial

Evaluating Google ADK Agents with W&B Weave for reliable insurance workflows

Brett Young, Christian Williams

Jul 17

Articles, GenAI, Evaluations, LLM, Framework / Integration

1 2 3...10

Popular Topics

Task

GenAI Agents Evaluations MLOps Fine-tuning All

Framework / Integration

Keras PyTorch HuggingFace GPT OpenAI All

Domain

Computer Vision Domain Agnostic NLP LLM Reinforcement Learning All

Iterate on AI agents and models faster. Try Weights & Biases today.

Sign up Try W&B now