Traces
All Ops
Filter
Past 1w
inputs
Trace
Feedback
Status
frequency_penalty
max_completion_tokens
max_tokens
messages
model
n
presence_penalty
N/A
N/A
N/A
[{"role":"system","content":"You are a judge of performance. Repond in JSON with the following fields:\n * score: (boolean) a boolean\n * reason: (string) a short explanation of the score\n "},{"role":"user","content":"Given a user input, expected output, and model output, determine how correct the model output is. Exact match is not required.\nUser Input: Hello, how are you?\n---\nExpected Output: I'm good, thank you!\n---\nModel Output: Hi! I'm doing well—thanks for asking. How are yo...
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a judge of performance. Repond in JSON with the following fields:\n * score: (boolean) a boolean\n * reason: (string) a short explanation of the score\n "},{"role":"user","content":"Given a user input, expected output, and model output, determine how correct the model output is. Exact match is not required.\nUser Input: What are you?\n---\nExpected Output: I am a helpful AI assistant.\n---\nModel Output: I'm Kimi, your intelligent assistant and good ...
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a judge of performance. Repond in JSON with the following fields:\n * score: (boolean) a boolean\n * reason: (string) a short explanation of the score\n "},{"role":"user","content":"Given a user input, expected output, and model output, determine how correct the model output is. Exact match is not required.\nUser Input: What is the capital of France?\n---\nExpected Output: Paris\n---\nModel Output: The capital of France is **Paris**.\n---\nIs the mod...
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a helpful assistant."},{"role":"user","content":"What are you?"}]
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a helpful assistant."},{"role":"user","content":"What is the capital of France?"}]
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a helpful assistant."},{"role":"user","content":"Hello, how are you?"}]
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
0
N/A
32,768
[{"role":"system","content":"You are an AI assistant designed to assist users by providing clear, concise, and helpful responses."},{"role":"user","content":"Which company name is the combination of modern airplane propulsion technology and the thinking organ of humans?"}]
gemini/gemini-2.5-pro-preview-03-25
1
0
0
64,000
N/A
[{"role":"system","content":"You are an AI assistant designed to assist users by providing clear, concise, and helpful responses."},{"role":"user","content":"Which company name is the combination of modern airplane propulsion technology and the thinking organ of humans?"}]
gpt-5-2025-08-07
1
0
0
N/A
32,000
[{"role":"system","content":"You are an AI assistant designed to assist users by providing clear, concise, and helpful responses."},{"role":"user","content":"Which company name is the combination of modern airplane propulsion technology and the thinking organ of humans?"}]
openai/openai/gpt-oss-120b
1
0
0
N/A
500
[{"role":"system","content":"You are an AI assistant designed to assist users by providing clear, concise, and helpful responses."},{"role":"user","content":"Which company name is the combination of modern airplane propulsion technology and the thinking organ of humans?"}]
openai/Qwen/Qwen3-Coder-480B-A35B-Instruct
1
0
N/A
N/A
N/A
[{"role":"system","content":"You are a judge of performance. Repond in JSON with the following fields:\n * score: (boolean) a boolean\n * reason: (string) a short explanation of the score\n "},{"role":"user","content":"Given a user input, expected output, and model output, determine how correct the model output is. Exact match is not required.\nUser Input: What are you?\n---\nExpected Output: I am a helpful AI assistant.\n---\nModel Output: I’m Kimi, a large language model trained by Mo...
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a judge of performance. Repond in JSON with the following fields:\n * score: (boolean) a boolean\n * reason: (string) a short explanation of the score\n "},{"role":"user","content":"Given a user input, expected output, and model output, determine how correct the model output is. Exact match is not required.\nUser Input: Hello, how are you?\n---\nExpected Output: I'm good, thank you!\n---\nModel Output: Hello! I’m doing well—thanks for asking. How abo...
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
N/A
N/A
N/A
[{"role":"system","content":"You are a judge of performance. Repond in JSON with the following fields:\n * score: (boolean) a boolean\n * reason: (string) a short explanation of the score\n "},{"role":"user","content":"Given a user input, expected output, and model output, determine how correct the model output is. Exact match is not required.\nUser Input: What is the capital of France?\n---\nExpected Output: Paris\n---\nModel Output: The capital of France is Paris.\n---\nIs the model o...
openai/moonshotai/Kimi-K2-Instruct
N/A
N/A
1-32 of 32
Per page:
50