March 2025

All Articles - 2

2025

2025-03-05

An Introduction to Policy Optimization

2025-03-05

Key Concepts in Reinforcement Learning

Magnicord

Re: Deep Learning From Scratch

Recent Posts

UV: The Definitive Solution for PyTorch, flash-attn, VeRL and OpenRLHF2025-08-10

Policy Gradient Algorithms: From REINFORCE to PPO2025-07-15

The Evolution of Policy Optimization for Enhancing LLM Reasoning: From PPO to GRPO Variants2025-07-15

An Introduction to PPO in RLHF2025-07-15

X-Enhanced Contrastive Decoding Strategies for Large Language Models2025-05-30

Categories

Deep Learning Basics1
NLP5
Python Basics2
RL4LLM2
Reinforcement Learning Basics3
dev-tools1

Tags

data-structure uv Python reinforcement-learning RLHF environment PEFT NLP reasoning deep-learning Pytorch LLM

Archives

August 2025 1
July 2025 3
May 2025 1
March 2025 2
January 2025 4
January 2024 1
October 2023 2

Website Info

Article Count :

14

Total Word Count :

71.1k

Unique Visitors :

Page Views :

Last Update :

© 2023 - 2025 By Magnicord

Welcome to the Journey of Deep Learning