TWiST 500 interviews with Cortical Labs, Turing, AND Mercor | E2159

This Week in Startups
1:23:28
238,436 views
2 months ago

Click to expand description...

Watch on YouTube

💬Comments

No comments available for this video

It's really important to um benchmark um LLMs to figure out where they are good

at, where are gaps in the models so that you can generate data that could help the LLM get better at those specific

tasks. One big challenge, Alex, that I see today in the overall evaluation and

benchmarking market is that a lot of the evaluations are somewhat academic and somewhat synthetic and don't connect to

real world applications or real world use. Ideally, you want uh AGI to

Unlock Full Transcript

You're viewing the first 5 lines

Register for free to access the complete transcript with timestamps

Free forever
No credit card
Showing first 5 lines • 806 more lines available