Marius Dinca
Puddings22
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 5 hours ago
Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm
upvoted
a
paper
about 5 hours ago
Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm
upvoted
a
paper
about 5 hours ago
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation
Organizations
None yet