Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

MLP

classroom

AI & ML interests

Building ChatGPT for EPFL Course Contents

cs552-mlp 's collections 4

Submitted Models

This collection includes base phi3-mini A) DPO aligned B) SFT on ARC train split (3 epochs) C) Quantised version of B) using GPTQ.

cs552-mlp/phi3-dpo

Updated May 28, 2024 • 5
cs552-mlp/phi3-lora-arc3

Updated Jun 11, 2024 • 4
cs552-mlp/phi3-lora-arc3-gptq-4bits

Text Generation • 4B • Updated Jun 12, 2024 • 6

M3: SFT for MCQA

Collection of Phi3 models aligned for MCQA for respective datasets for 3 epochs. MCQ dataset is combination of all datasets.

cs552-mlp/phi3-lora-arc3

Updated Jun 11, 2024 • 4
cs552-mlp/phi3-lora-sciq3

Updated Jun 11, 2024 • 4
cs552-mlp/phi3-lora-openbookqa3

Updated Jun 11, 2024 • 5
cs552-mlp/phi3-lora-mcq3

Updated Jun 11, 2024 • 3

M3: Quantisation

Collection of the quantised version of the best performing model finetuned for MCQA

cs552-mlp/phi3-lora-arc3-gptq-4bits

Text Generation • 4B • Updated Jun 12, 2024 • 6
cs552-mlp/phi3-lora-arc3-gptq-3bits

Text Generation • 4B • Updated Jun 12, 2024 • 7
cs552-mlp/phi3-lora-arc3-gptq-2bits

Text Generation • 4B • Updated Jun 12, 2024 • 8
cs552-mlp/phi3-lora-gptq-8bits

Text Generation • 4B • Updated Jun 11, 2024 • 7

M2: DPO Aligned Models

Collection of the QLoRA Phi-3 DPO aligned models, each having different hyper-parameters.

cs552-mlp/phi3-dpo-h4

Updated May 28, 2024 • 3

Submitted Models

This collection includes base phi3-mini A) DPO aligned B) SFT on ARC train split (3 epochs) C) Quantised version of B) using GPTQ.

cs552-mlp/phi3-dpo

Updated May 28, 2024 • 5
cs552-mlp/phi3-lora-arc3

Updated Jun 11, 2024 • 4
cs552-mlp/phi3-lora-arc3-gptq-4bits

Text Generation • 4B • Updated Jun 12, 2024 • 6

M3: Quantisation

Collection of the quantised version of the best performing model finetuned for MCQA

cs552-mlp/phi3-lora-arc3-gptq-4bits

Text Generation • 4B • Updated Jun 12, 2024 • 6
cs552-mlp/phi3-lora-arc3-gptq-3bits

Text Generation • 4B • Updated Jun 12, 2024 • 7
cs552-mlp/phi3-lora-arc3-gptq-2bits

Text Generation • 4B • Updated Jun 12, 2024 • 8
cs552-mlp/phi3-lora-gptq-8bits

Text Generation • 4B • Updated Jun 11, 2024 • 7

M3: SFT for MCQA

Collection of Phi3 models aligned for MCQA for respective datasets for 3 epochs. MCQ dataset is combination of all datasets.

cs552-mlp/phi3-lora-arc3

Updated Jun 11, 2024 • 4
cs552-mlp/phi3-lora-sciq3

Updated Jun 11, 2024 • 4
cs552-mlp/phi3-lora-openbookqa3

Updated Jun 11, 2024 • 5
cs552-mlp/phi3-lora-mcq3

Updated Jun 11, 2024 • 3

M2: DPO Aligned Models

Collection of the QLoRA Phi-3 DPO aligned models, each having different hyper-parameters.

cs552-mlp/phi3-dpo-h4

Updated May 28, 2024 • 3

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs