EPFL Machine Learning and Optimization Laboratory

university

https://www.epfl.ch/labs/mlo/

epfml

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

haeggee authored a paper about 3 hours ago

Training Dynamics of the Cooldown Stage in Warmup-Stable-Decay Learning Rate Scheduler

haeggee authored a paper about 3 hours ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

haeggee authored a paper about 5 hours ago

The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

View all activity

haeggee

authored 2 papers about 3 hours ago

Training Dynamics of the Cooldown Stage in Warmup-Stable-Decay Learning Rate Scheduler

Paper • 2508.01483 • Published Aug 2, 2025

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 15

haeggee

authored a paper about 5 hours ago

The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

Paper • 2601.23045 • Published 3 days ago

vsabolcec

in epfml/FineWeb-HQ about 1 month ago

Error when loading LFS files

#2 opened 2 months ago by

oligou

vsabolcec

updated a collection 4 months ago

FineWeb-HQ datasets

Collection

Collection containing FineWeb-HQ and FineWeb2-HQ quality filtered datasets. • 3 items • Updated Oct 8, 2025

mjaggi

authored a paper 4 months ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 15

NXz64Fdf8Y

updated a dataset 4 months ago

epfml/FineWeb-HQ

Viewer • Updated Sep 30, 2025 • 2.45B • 12.4k • 4

Andron00e

authored a paper 5 months ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 15

mjaggi

authored a paper 5 months ago

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published Sep 1, 2025 • 25

MatPag

authored a paper 5 months ago

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published Sep 1, 2025 • 25

Andron00e

authored 2 papers 5 months ago

Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed

Paper • 2406.04443 • Published Jun 6, 2024

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published Sep 1, 2025 • 25

NXz64Fdf8Y

published a dataset 5 months ago

epfml/FineWeb-HQ

Viewer • Updated Sep 30, 2025 • 2.45B • 12.4k • 4

haeggee

authored 2 papers 7 months ago

BaCaDI: Bayesian Causal Discovery with Unknown Interventions

Paper • 2206.01665 • Published Jun 3, 2022 • 2

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28

NXz64Fdf8Y

authored a paper 7 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 77

mjaggi

authored a paper 7 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 77

vsabolcec

authored a paper 7 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 77

AI & ML interests

Recent Activity

Team members 10

epfml's activity

Error when loading LFS files