LLM eval & testing toolkit
Recall-Oriented Understudy for Gisting Evaluation (ROUGE) Evaluation Functions with TypeScript support