Ejecutar evaluación completa de una conversación usando LLM-as-Judge.
curl -X POST https://app.horneross.com/api/conversations/conv_abc123/evaluate \ -H "Authorization: Bearer sk_live_xxx" \ -H "Content-Type: application/json" \ -d '{ "model": "gpt-4o", "guidelines": "Evaluar claridad, precisión y amabilidad de las respuestas" }'
{ "success": true, "evalRun": { "id": "eval_xyz789", "type": "llm_judge", "status": "completed", "overallScore": 8.5, "scores": { "clarity": 9, "accuracy": 8, "helpfulness": 9, "tone": 8 }, "createdAt": "2024-01-21T16:00:00Z", "completedAt": "2024-01-21T16:00:05Z" } }
POST /api/conversations/{conversationId}/evaluate
application/json
gpt-4o
gpt-4o-mini
claude-3-5-sonnet
Show Propiedades de evalRun
llm_judge
human
completed
failed
pending
Was this page helpful?