Anyscale’s Post

View organization page for Anyscale, graphic

25,434 followers

Introducing RouteLLM: a sophisticated routing framework developed with Berkeley-LMSys in collaboration with Anyscale. RouteLLM optimizes query handling by dynamically selecting between high-performance proprietary LLMs and cost-effective open-source models, cutting costs by over 2x without sacrificing quality. Using human preference data and LLM-as-a-judge for data augmentation, our routers evaluate query complexity to choose the appropriate model. Rigorous testing on benchmarks like MMLU and GSM8K confirms our cost-efficient, high-quality performance. Explore our open-source code, models, and preference data on GitHub: https://lnkd.in/gJyijZRx, and try our online demo: https://lnkd.in/gME-GC2a Explore More about RouteLLM and learn how it can revolutionize your LLM applications: Read our Blog here: https://lnkd.in/gfPD-u-y LLMsys Blog here: https://lnkd.in/ga_MgERE Full research paper here: https://lnkd.in/gqRy7Pjy

GitHub - anyscale/llm-router: Tutorial for building LLM router

GitHub - anyscale/llm-router: Tutorial for building LLM router

github.com

Royal Sequeira

Machine Learning Engineer @ Georgian | Founder & Convener, Sushiksha

2w

Wonder how the results compares to GPT-4o both with regards to cost and performance? Also, are there hidden costs such as hosting the causal LLM?

To view or add a comment, sign in

Explore topics