Evaluating Answers with Large Language Models: How InferESG and RAGAS Helped
In our latest project, we set out to evaluate how different Large Language Models (LLMs) perform when responding to user prompts. Our goal was not to determine which model is superior, but to assess whether open-source models can serve as viable alternatives to OpenAI’s proprietary models. We did this by building on our existing platform, InferESG, which automatically generates reports from ESG disclosures.

Ana Fonseca















