Skip to main content
wandb-japan
Projects
ichikara-test
Traces
Log in
Sign up
Overview
Models
Workspace
Runs
More
Weave
Traces
Evals
Playground
Monitors
More
Traces
All Ops
Filter
inputs
output
model_latency
scores
domain_score
ビジネス
医療
Trace
Feedback
Status
model
question
self
generated_text
mean
mean
mean
Evaluation.evaluate
2835
📝 1
1
LLMinvoke:v96
N/A
ichikara_human_eval:v1
N/A
0.2124
3.7333
3.55
Evaluation.evaluate
7131
📝 1
LLMinvoke:v95
N/A
ichikara_human_eval:v1
N/A
0.3216
3.7
3.4
Evaluation.evaluate
4309
📝 1
LLMinvoke:v94
N/A
ichikara_human_eval:v1
N/A
0.1772
4.4
4.3
Evaluation.evaluate
cc00
1
LLMinvoke:v87
N/A
test_20240905:v53
N/A
0.1959
3.6897
3.4
Evaluation.evaluate
ed0c
1
LLMinvoke:v85
N/A
test_20240905:v53
N/A
0.2246
4.4138
4.3
Evaluation.evaluate
8fc3
1
LLMinvoke:v84
N/A
test_20240905:v53
N/A
0.2062
3.7241
3.55
Evaluation.evaluate
87c0
🤖 1
2
LLMinvoke:v74
N/A
test_20240905:v24
N/A
19.6149
4.6
4.2
Evaluation.evaluate
7ec9
🤖 1
1
LLMinvoke:v72
N/A
test_20240905:v24
N/A
6.6422
4.7333
4.6
Evaluation.evaluate
d8aa
🤖 1
1
LLMinvoke:v71
N/A
test_20240905:v24
N/A
20.7779
4.1667
4.45
LLMinvoke.predict
31a5
N/A
All_20240829:v0/attr/rows/id/krFolqiZz0XTskoOckrkoXZZNCJYjjGY8pjH2Hdpwyg/key/text
LLMinvoke:v2
N/A
N/A
N/A
N/A
LLMinvoke.predict
ae9c
N/A
All_20240829:v0/attr/rows/id/Eguzbzwnj5PeZu0qCeWZMmYYDOdMBGVuz48qoZMsNcw/key/text
LLMinvoke:v2
AIMessage:v71
N/A
N/A
N/A
LLMinvoke.predict
7432
N/A
All_20240829:v0/attr/rows/id/KiHgfjASF3lxTenIdKV19wzwGP0EpTHjKsLJc9jnr2E/key/text
LLMinvoke:v2
AIMessage:v70
N/A
N/A
N/A
LLMinvoke.predict
b3e0
N/A
All_20240829:v0/attr/rows/id/RnP3lolHodq9bPVvtNsWXYFGSBZxPQOli7fAdyNjpSM/key/text
LLMinvoke:v2
AIMessage:v69
N/A
N/A
N/A