Zhangzhe's Blog
The projection of my life.
Home
Tags
Categories
Search
0%
LLM
Tag
2024
12-27
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
12-26
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
12-26
LOMO:Full Parameter Fine-tuning for Large Language Models with Limited Resources
12-20
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
12-20
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
12-13
上手训练大模型(2)——以LlamaFactory视角看大模型微调全流程
12-13
上手训练大模型(1)——用Alpaca-cleaned指令微调Llama-3.2-3B
10-29
大模型量化/部署——在AX650上部署Qwen模型
10-22
用 transformers 推理 Qwen2-0.5B-Instruct
10-12
LoRA: Low-Rank Adaptation of Large Language Models
1
2
3
4
5