郝彦飞 MOA:实现本地混合智能体,击败GPT-4o Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models 混合代理 (MoA) – 在 AlpacaEval 和 OSS 模型上为 65.1% Mixture of Agents (MoA) is a novel approach that leverages the collective strengths of... 开源 推理
郝彦飞 Cake:Distributed LLM inference for mobile, desktop and server. Cake is a pure Rust implementation of the LLama3 distributed inference based on Candle . The goal of the project is being able to run big (70B+) models by repurposing consumer hardware into an heterog... 底层工具 开源 推理