docetl/experiments/structured_outputs.txt

Results Table:
                                                       Experiment Results
╭────────────────────────────────────────────────┬───────┬────────────┬───────────┬────────┬───────┬─────────────┬──────────────╮
│ Model                                          │ Doc % │ Approach   │ Precision │ Recall │    F1 │ Avg Runtime │ Avg Cost ($) │
├────────────────────────────────────────────────┼───────┼────────────┼───────────┼────────┼───────┼─────────────┼──────────────┤
│ azure/gpt-4o-mini                              │   10% │ structured │     0.869 │  0.872 │ 0.853 │      1.100s │      $0.0004 │
│ azure/gpt-4o-mini                              │   10% │ tool       │     0.914 │  0.906 │ 0.891 │      0.722s │      $0.0004 │
├────────────────────────────────────────────────┼───────┼────────────┼───────────┼────────┼───────┼─────────────┼──────────────┤
│ deepseek/deepseek-chat                         │   10% │ structured │     0.878 │  0.889 │ 0.877 │      2.094s │      $0.0003 │
│ deepseek/deepseek-chat                         │   10% │ tool       │     0.867 │  0.856 │ 0.860 │      2.212s │      $0.0003 │
├────────────────────────────────────────────────┼───────┼────────────┼───────────┼────────┼───────┼─────────────┼──────────────┤
│ lm_studio/hugging-quants/llama-3.2-3b-instruct │   10% │ structured │     0.033 │  0.022 │ 0.027 │     33.635s │      $0.0000 │
│ lm_studio/hugging-quants/llama-3.2-3b-instruct │   10% │ tool       │     0.000 │  0.000 │ 0.000 │     70.858s │      $0.0000 │
╰────────────────────────────────────────────────┴───────┴────────────┴───────────┴────────┴───────┴─────────────┴──────────────╯