Zhangzhe's Blog
The projection of my life.
Home
Tags
Categories
Search
0%
LLM
Category
2025
01-07
大模型DPO入门
01-06
大模型RLHF入门
01-04
大模型RAG入门
2024
12-31
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
12-27
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
12-27
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
12-13
上手训练大模型(2)——以LlamaFactory视角看大模型微调全流程
12-13
上手训练大模型(1)——用Alpaca-cleaned指令微调Llama-3.2-3B
10-22
用 transformers 推理 Qwen2-0.5B-Instruct
10-11
LLaVA: Visual Instruction Tuning
1
2