Skip to content

Commit 26f0340

Browse files
committed
fix: Correct some errors in 'The Llama 3 Herd of Models'
1 parent b4e4d11 commit 26f0340

File tree

2 files changed

+230
-232
lines changed

2 files changed

+230
-232
lines changed

_posts/2025-10-07-deepseek-v3-technical-report.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
layout: post
33
title: "DeepSeek-V3 Technical Report"
44
date: 2024-12-27 04:03:16
5-
author: "DeepSeek AI Research"
5+
author: "DeepSeek AI"
66
categories: ["Paper Reviews", "Language-Models"]
77
tags: ["Auxiliary-Loss-Free-Load-Balancing", "Multi-Token-Prediction", "Multi-Head-Latent-Attention", "DeepSeekMoE-Architecture", "FP8-Mixed-Precision-Training", "Efficient-Cross-Node-All-to-All-Communication", "Node-Limited-Routing", "Computation-Communication-Overlap", "Tile-Wise-Fine-Grained-Quantization", "Speculative-Decoding"]
88
cover: /assets/images/language-models.jpg

0 commit comments

Comments
 (0)