时间:2024-02-27|浏览:226
用戶喜愛的交易所
已有账号登陆后会弹出下载
Mistral 是一家成立仅一年的人工智能初创公司,以其独特的艺术字徽标和欧洲历史上最大的种子轮融资而成为头条新闻,它推出了 Mistral Large——其最新、最大的企业模型——并与微软。
从今天开始,Mistral Large 被设计为一种文本生成模型,能够处理复杂的多语言推理任务,包括文本理解、转换和代码生成。
根据该公司分享的大规模多任务语言理解(MMLU)基准测试结果,它的表现相当不错,仅次于 GPT-4,是通过 API 提供的第二佳模型。
Mistral 表示,由于与微软建立了新的合作伙伴关系,大型模型将主要通过其 API 提供,但也可以通过 Azure AI 提供。
该公司还推出了 Mistral Small 的优化版本(该公司提供的较小型号)以及一款聊天应用程序,以帮助业务团队了解该公司提供的产品。
米斯特拉尔·大号:会发生什么?
作为一个多语言模型,Mistral Large 不仅可以流利地理解、推理和生成英语文本,还可以使用其他语言(从法语、西班牙语、德语和意大利语开始)。
现在,这并不是什么新鲜事,因为谷歌和 OpenAI 也提供多语言模型,但 Mistral 强调,其产品对所有语言都有“对语法和文化背景的细致理解”,这将带来更好的结果。
该模型具有 32K 个 token 的上下文窗口,这使得它能够处理大型文档并精确回忆信息。
它还具有精确的指令跟踪功能,允许开发人员设计他们的审核策略和本机函数调用。
虽然新模型在现实世界中的表现如何还有待观察,特别是针对 Gemini 1.5(支持多达 100 万个代币)等更大的产品,但 Mistral 表示,该模型在应对竞争对手产品方面做得相当不错。
例如,在 MMLU 测试中,Mistral Large 的准确率为 81.2%,仅次于 GPT-4 的 86.4%。
该基准测试不包括Gemini Pro 1.5,但Gemini Pro 1.0得分为71.8%。
Llama 2 70B也以69.9%的成绩落后。
The Meta offering even failed to beat (or match) Mistral in language-specific tests.
While similar rankings were seen in the GSM8K Math benchmark involving Llama and the GPT family, coding seemed to be a weak point for Mistral Large. In the HumanE benchmark for coding performance, the new large model performed with an accuracy of 45.1%, sitting well behind GPT-3.5, GPT-4 and Gemini Pro 1.0.
The company has also launched a new version of its smaller model, Mistral Small, with optimizations for latency and cost. It outperforms Mixtral 8x7B and serves as an intermediary solution between the company’s open-weight offering and Mistral Large.
While building models that perform well is crucial, you must ensure they reach the right customers as and when needed – an aspect critical for growth. This is where Mistral’s strategic partnership with Microsoft comes in.
Under this engagement, all open and commercial models offered by Mistral, including the new large model, will be made available on Azure AI Studio and Azure Machine Learning. This makes Mistral only the second company to make its commercial language models available on Azure.
Mistral says Azure users can tap the models with their existing credits and use them with “as seamless a user experience as with its own APIs.” The company will also provide direct access to its support team to customers coming via Azure.
“At Mistral AI, we make generative AI ubiquitous – through our open-source models and by bringing our commercial models where developers create. We are very proud to announce the availability of Mistral Large on Azure AI. Microsoft’s trust in our model is a step forward in our journey to put frontier AI in everyone’s hands,” Arthur Mensch, co-founder and CEO of Mistral AI, said in a statement.
That said, for Mistral, Microsoft will not be the only distribution partner. A few days ago, Amazon Web Services (AWS) Principal Developer Advocate Donnie Prakoso also announced that the French startup’s open models will come on Amazon Bedrock, its managed service for gen AI offerings and application development. However, he did not share when exactly it would happen.
To gain the trust of companies and eventually bring them on board via these channels, Mistral is also launching a chat app, a multilingual conversational assistant that shows what teams can build with its models and deploy in their respective business environments.
Users can create an account on Mistral’s website for beta access to Mistral Chat and interact with the models the company has on offer in a pedagogical and fun way. However, the company does caution that it will not be able to access the internet and may deliver inaccurate or outdated information in some cases. The company is also building an enterprise-centric version of the assistant with self-deployment capacities with fine-grained moderation.
该公司在另一篇博文中指出:“得益于可调节的系统级审核机制,当你将对话推向助理可能会产生敏感或有争议内容的方向时,le Chat 会以非侵入性的方式向你发出警告。”
根据 Crunchbase 的数据,Mistral 在由 Lightspeed Venture Partners 和 Andreessen Horowitz (a16z) 等知名投资者领投的种子轮和 A 轮融资中筹集了超过 5 亿美元。