Tokenization in NLP - Search News

14h

How test-time scaling unlocks hidden reasoning abilities in small language models (and allows them to outperform LLMs)

A 1B small language model can beat a 405B large language model in reasoning tasks if provided with the right test-time scaling strategy.

Cascaded Speech Translation Systems Outperform End-to-End Models, Research Finds

SpeechT mentorship connects researchers and practitioners to explore speech translation, finding cascaded architectures ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results