福模

免费开源AI模型下载_本地AI工具资源平台

大语言模型Large Language Models

Chinchilla大语言AI模型 - 训练效率优化

Chinchilla Large Language AI Model - Training Efficiency Optimization

Chinchilla大语言AI模型,训练效率优化的语言模型。证明了缩放定律,展示了参数数量与训练数据的关系,实现了更高效的训练。

Chinchilla large language AI model, a training efficiency optimized language model. Proves the scaling law, demonstrating the relationship between parameter count and training data, achieving more efficient training.

Chinchilla大语言模型训练效率缩放定律ChinchillaLarge Language ModelTraining EfficiencyScaling Law

文件大小

25.6 GB

Upload Size

25.6 GB

上传日期

2025-02-11

Upload Date

2025-02-11

下载次数

11,200

Downloads

11,200

评分

4.7/5.0

Rating

4.7/5.0

下载资源 Download Resources

下载资源表示您同意我们的使用条款和隐私政策

By downloading this resource, you agree to our Terms of Service and Privacy Policy

相关资源推荐

Qwen 2.5 开源模型资源 - 7B/14B/72B全系列版本Qwen 2.5 Open Source Model Resources - Full Series 7B/14B/72B Versions

Qwen 2.5开源模型资源,提供7B/14B/72B全系列版本。每个版本均经过优化,支持长文本输入和多语言处理,适用于不同的应用场景和硬件配置。

Qwen 2.5 open source model resources, providing full series 7B/14B/72B versions. Each version has been optimized to support long text input and multilingual processing, suitable for different application scenarios and hardware configurations.

Qwen开源模型多语言QwenOpen Source ModelMultilingual
52.8 GB2024-01-12
ERNIE Bot 4.5语义理解AI模型 - 中文NLP优化ERNIE Bot 4.5 Semantic Understanding AI Model - Chinese NLP Optimized

ERNIE Bot 4.5语义理解AI模型,专门为中文NLP优化的语义理解模型。在中文语言理解、生成和推理方面表现出色,支持多种应用场景。

ERNIE Bot 4.5 semantic understanding AI model, a semantic understanding model specifically optimized for Chinese NLP. Excels in Chinese language understanding, generation, and reasoning, supporting multiple application scenarios.

ERNIE Bot中文NLP语义理解ERNIE BotChinese NLPSemantic Understanding
7.8 GB2025-02-13
Grok-1超大规模AI语言模型 - 330B参数稀疏专家系统Grok-1 Ultra-Large Scale AI Language Model - 330B Parameter Sparse Expert System

Grok-1超大规模AI语言模型,330B参数的稀疏专家系统。采用MoE架构,具备卓越的语言理解和生成能力,支持复杂推理和长文本处理,代表当前AI语言模型的前沿水平。

Grok-1 ultra-large scale AI language model, a 330B parameter sparse expert system. Using MoE architecture, it possesses excellent language understanding and generation capabilities, supporting complex reasoning and long text processing, representing the current frontier level of AI language models.

Grok超大模型MoEGrokUltra-Large ModelsMoE
220 GB2025-02-27