TWiST 500 interviews with Cortical Labs, Turing, AND Mercor | E2159

This Week in Startups•

1:23:28

•

238,436 views

•

2 months ago

Click to expand description...

Watch on YouTube

💬Comments

No comments available for this video

It's really important to um benchmark um LLMs to figure out where they are good

at, where are gaps in the models so that you can generate data that could help the LLM get better at those specific

tasks. One big challenge, Alex, that I see today in the overall evaluation and

benchmarking market is that a lot of the evaluations are somewhat academic and somewhat synthetic and don't connect to

real world applications or real world use. Ideally, you want uh AGI to

Unlock Full Transcript

You're viewing the first 5 lines

Register for free to access the complete transcript with timestamps

Register Free Sign In

Free forever

No credit card

Showing first 5 lines • 806 more lines available

More from This Week in Startups

Board Talk Empire with GRIN's Brandon Brown, Founder Hacks & SEC Memecoin Clarity? | E2091

Board Talk Empire with GRIN's Brandon Brown, Founder Hacks & SEC Memecoin Clarity? | E2091

US Crypto Reserve Bombshell, Ramp’s $13B Climb, & more | E2092

US Crypto Reserve Bombshell, Ramp’s $13B Climb, & more | E2092

AI Agents & the Future of Work with LangChain’s Harrison Chase | AI Basics with Google Cloud

AI Agents & the Future of Work with LangChain’s Harrison Chase | AI Basics with Google Cloud

Coreweave IPO, AVRide & Digg is Back! | E2093

Coreweave IPO, AVRide & Digg is Back! | E2093

AI’s Next Leap: Hyper-Realistic Agents & Unlimited Power | E2094

AI’s Next Leap: Hyper-Realistic Agents & Unlimited Power | E2094

Student Roy Lee Hacks Amazon, SBF's Jailhouse Chat + more! | E2095

Student Roy Lee Hacks Amazon, SBF's Jailhouse Chat + more! | E2095