arxiv:2606.30634
Egor Petrov
moderntalker
AI & ML interests
None yet
Recent Activity
authored a paper about 11 hours ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining updated a model 2 days ago
moderntalker/efficient_pretrain_checkpointsOrganizations
None yet