Query Configuration
Execute query against DrLPita model
Compare responses and generate evaluation scores
Evaluation
F1 Score
Token Usage
tokens
Latency
mins