Ayut's group workspace
auto-eval
What makes this group special?
Tags
dazzling-thunder-125
Notes
Author
State
Finished
Start time
January 18th, 2024 4:46:42 PM
Runtime
16s
Tracked hours
11s
Run path
wandbot/wandbot-eval/70d3hnau
OS
Linux-6.5.0-14-generic-x86_64-with-glibc2.35
Python version
3.10.13
Git repository
git clone git@github.com:wandb/wandbot.git
Git state
git checkout -b "dazzling-thunder-125" bede5132ed751fed68108e95f56f64085f924146
Command
<python with no main file>
System Hardware
CPU count | 10 |
Logical CPU count | 20 |
GPU count | 1 |
GPU type | NVIDIA GeForce RTX 3060 |
W&B CLI Version
0.16.0
Group
auto-evalJob Type
eval-analysis
Config
Config parameters are your model's inputs. Learn more
No config parameters were saved for this run.
Check the configuration documentation for more information.
Summary
Summary metrics are your model's outputs. Learn more
- {} 33 keys▶
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Correctness Barplot_24_e1bd0ee1af1f2494627b.plotly.json"
- "e1bd0ee1af1f2494627b3d2d8a2ba59d836867c86f9a9225de8387ffcbef316d"
- 8,440
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Correctness Histogram_3_f8d1002f4618e187cd7f.plotly.json"
- "f8d1002f4618e187cd7f3e9a280e29e4c4aa61a4ab2921dda8145ed40467b004"
- 10,626
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Correctness Score Heatmap_0_4c4ae295de0d60e637de.plotly.json"
- "4c4ae295de0d60e637deb25ac7139a06d79498dcdce446bf0660090af91f083c"
- 8,554
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Correctness vs Latency_16_5e1d44b881f8b9d96e0e.plotly.json"
- "5e1d44b881f8b9d96e0ef28d947e0da334afc9d84a53f5391960f957adf80fed"
- 14,959
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Faithfulness Barplot_25_d27944e5fc85c9ac6931.plotly.json"
- "d27944e5fc85c9ac693119b67799751e5c23de7336d2b184b1239cbbd6a292a0"
- 8,445
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Faithfulness Histogram_4_2f9a5ed4a3198980e132.plotly.json"
- "2f9a5ed4a3198980e13249a196e3b01616c1382f8c67255036661e1b349e9c48"
- 10,626
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Faithfulness Score Heatmap_1_2f85e4deaac91641a4fb.plotly.json"
- "2f85e4deaac91641a4fb86f8ccbb63911c6ce74080baf6822ce6057a966c2693"
- 8,557
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Faithfulness vs Latency_17_b026e13648282c989f50.plotly.json"
- "b026e13648282c989f502748b62fd0aadba846fe72504a0e265c03fa4e479095"
- 14,964
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Relevancy Barplot_26_c26ccc1a9a811456b0dc.plotly.json"
- "c26ccc1a9a811456b0dc06866223a8bae70652e5778e662585b4ff4d4095cafb"
- 8,431
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Relevancy Histogram_5_baaf5e0a26a52c7fe946.plotly.json"
- "baaf5e0a26a52c7fe946ab52783e33db178f025259a9855cfa40bfd77db2624f"
- 10,626
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Relevancy Score Heatmap_2_22b3ad6e866b6c5abc77.plotly.json"
- "22b3ad6e866b6c5abc776f31ce095b6e7871b543097a79ca4d89a91721b8b711"
- 8,544
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Answer Relevancy vs Latency_18_3e4f3fed27c956db6055.plotly.json"
- "3e4f3fed27c956db6055fda6d8044a58e459ce0badaf2e964e4c571adfc6d516"
- 14,949
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Completion Tokens Violinplot_11_8fdb8bb0435241a3aca8.plotly.json"
- "8fdb8bb0435241a3aca84f0946d66407354a1126cfa9ce4cc11bacdd557ef3b4"
- 20,287
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Context Precision Barplot_31_cb51ffe5a4db24119fd0.plotly.json"
- "cb51ffe5a4db24119fd0f40315b5d19af558737985f4241d04b36b16fa41f9b9"
- 8,439
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Context Precision Violinplot_13_be5b8e3a360ef3b1355c.plotly.json"
- "be5b8e3a360ef3b1355cfc6dfa09808b4ba182e689c687fd07b73765200354b2"
- 23,563
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Context Recall Barplot_32_3089eb5f6672ff40196d.plotly.json"
- "3089eb5f6672ff40196dc537d3f0bf44a86fb64ef200db0c9304f845dec24930"
- 8,428
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Context Recall Violinplot_14_acd152a1ca7fede83f5f.plotly.json"
- "acd152a1ca7fede83f5f65bf6494f529700891e02e3d783124d12e62137f5ca3"
- 20,861
- {} 4 keys▶
- "plotly-file"
- "media/plotly/gpt-3.5-turbo-16k-0613 RadarPlot_20_c1d8ed43af68e6607393.plotly.json"
- "c1d8ed43af68e660739378036f5108685e35e64b607fa26db7ef724bb75e2650"
- 8,327
- {} 4 keys▶
- "plotly-file"
- "media/plotly/gpt-4-0613 RadarPlot_21_44127d677808ad92374f.plotly.json"
- "44127d677808ad92374f8dd436e3ff426f203bb6561effe96188c3453fe2ddaa"
- 8,316
- {} 4 keys▶
- "plotly-file"
- "media/plotly/gpt-4-1106-preview RadarPlot_22_eaa8c36b58a048ae2589.plotly.json"
- "eaa8c36b58a048ae2589e5d1b5a1933257952fcb92ec6c11e4dda5fd786af464"
- 8,323
- {} 4 keys▶
- "plotly-file"
- "media/plotly/gpt-4-1106-preview-v1.1 RadarPlot_23_605f51054bcaa9d91971.plotly.json"
- "605f51054bcaa9d919715184249148afec13d42601829943e92f7e734170e4c7"
- 8,333
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Model Comparison RadarPlot_19_124ceb47f7846c06228a.plotly.json"
- "124ceb47f7846c06228afeddac45e19b0033d4adacf172d1e42868971f9b5432"
- 9,787
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Prompt Tokens Violinplot_10_9ecdc440ad3343e50ff6.plotly.json"
- "9ecdc440ad3343e50ff64ef78ab43faf321b5cc1a31794fb06d2293ebfb72dfe"
- 20,595
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Correctness Score Barplot_27_cbe813ab899d39922644.plotly.json"
- "cbe813ab899d3992264487f8174367260b5729542dccaf9e9fb92f8d75052c52"
- 8,477
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Correctness Score ViolinPlot_6_47d6a37be3b12be8e679.plotly.json"
- "47d6a37be3b12be8e679841cd8df7b295a67f0794386d3767e5451c70e2df09a"
- 26,161
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Faithfulness Score Barplot_28_d3e1a364239c00256de4.plotly.json"
- "d3e1a364239c00256de419f0a6c24e2e8f45e49d5600fb8281eef9698d465de5"
- 8,479
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Faithfulness Score ViolinPlot_8_2b3b62db753efcacd597.plotly.json"
- "2b3b62db753efcacd597eafbb0f54a642ee40c95a1a7d3c33c074920d4c6a383"
- 22,218
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Relevancy Score Barplot_29_1798d88e76781a133c02.plotly.json"
- "1798d88e76781a133c02eec065540e819526bf6e426c515505e80336452872ad"
- 8,465
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Relevancy Score ViolinPlot_7_6944efe73e5e2d9b1ba5.plotly.json"
- "6944efe73e5e2d9b1ba5467d27b27bba4abf352a283b1e5af781107c76f79c77"
- 25,898
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Similarity Score Barplot_30_d27f5a05b8f7af50a2f2.plotly.json"
- "d27f5a05b8f7af50a2f221e04ff4f2f34ff63593cd25d8665286c9e640145da8"
- 8,474
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Ragas Answer Similarity Score ViolinPlot_9_4ed530d315d9ab46f41c.plotly.json"
- "4ed530d315d9ab46f41c167fb733dc0bbc134af80021b3366ef6fe423646c908"
- 25,967
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Time Taken Violinplot_15_bc3ef5ad03346f843722.plotly.json"
- "bc3ef5ad03346f843722621a9e0c0a93559010b77e07c400c47a89c2b415cc9c"
- 22,697
- {} 4 keys▶
- "plotly-file"
- "media/plotly/Total Tokens Violinplot_12_4c0036134f28fa80f735.plotly.json"
- "4c0036134f28fa80f735d2d502bec126f9a9b90065456f29e93bd906a8400ef2"
- 20,682
Artifact Inputs
This run consumed these artifacts as inputs. Learn more
Type
Name
Consumer count
Loading...
Artifact Outputs
This run produced these artifacts as outputs. Learn more
Type
Name
Consumer count
Loading...