New ask Hacker News story: Ask HN: What are you using for LLM response testing and benchmarking?

Ask HN: What are you using for LLM response testing and benchmarking?
2 by tin7in | 1 comments on Hacker News.
What are you using to test your LLM responses, benchmark them, maybe compare different versions? I've seen a few YC startups focusing on this but I haven't decided yet if we should build this internally or use an external tool.

Comments

Popular posts from this blog

New ask Hacker News story: Tell HN: Equifax free credit report dark patterns

New ask Hacker News story: Ask HN: Why can't the US government run their own social media?