You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: benchmarks/benchmarks_tests.csv
+2-1Lines changed: 2 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -20,4 +20,5 @@ T18,metadata_extraction,anthropic,claude-3-5-sonnet-20241022,Document,0.0,You ar
20
20
T19,metadata_extraction,genai,gemini-2.5-pro-exp-03-25,Document,0.0,You are a historian with keyword knowledge and an expert in the field of 20th century Swiss history,prompt.txt,false
21
21
T20,metadata_extraction,genai,gemini-2.0-flash-lite,Document,0.0,You are a historian with keyword knowledge and an expert in the field of 20th century Swiss history,prompt.txt,false
22
22
T21,metadata_extraction,genai,gemini-2.0-pro-exp-02-05,Document,0.0,You are a historian with keyword knowledge and an expert in the field of 20th century Swiss history,prompt.txt,false
23
-
T22,fraktur,genai,gemini-2.5-pro-exp-03-25,"",0.0,You are a historian with keyword knowledge and an expert in the field of 20th century Swiss history,prompt.txt,false
23
+
T22,fraktur,genai,gemini-2.5-pro-exp-03-25,"",0.0,You are a historian with keyword knowledge and an expert in the field of 20th century Swiss history,prompt.txt,false
24
+
T23,metadata_extraction,mistral,pixtral-large-latest,Document,0.0,You are a historian with keyword knowledge and an expert in the field of 20th century Swiss history. You only return valid JSON an no other text.,prompt.txt,false
0 commit comments