CityU-Zongxian commited on
Commit
2924e17
·
verified ·
1 Parent(s): ca9fc8c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -30,6 +30,13 @@ tags:
30
  Instead of focusing solely on the final answer, Med-REFL improves the model's intermediate reasoning process. It leverages a Tree-of-Thought (ToT) methodology to explore diverse reasoning paths and automatically constructs Direct Preference Optimization (DPO) data. This trains the model to identify and correct its own reasoning errors, leading to more accurate and trustworthy outputs.
31
 
32
  This repository contains the LoRA weights produced by the Med-REFL framework for various base models.
 
 
 
 
 
 
 
33
 
34
  # <span>Available Weights</span>
35
  The Med-REFL LoRA weights can be applied to the following base models to enhance their medical reasoning abilities.
 
30
  Instead of focusing solely on the final answer, Med-REFL improves the model's intermediate reasoning process. It leverages a Tree-of-Thought (ToT) methodology to explore diverse reasoning paths and automatically constructs Direct Preference Optimization (DPO) data. This trains the model to identify and correct its own reasoning errors, leading to more accurate and trustworthy outputs.
31
 
32
  This repository contains the LoRA weights produced by the Med-REFL framework for various base models.
33
+ # <span>Performance</span>
34
+ | Domain | Benchmark | Original | **+ Med-REFL** |
35
+ | :--- | :--- | :--- | :--- |
36
+ | **In-Domain** | MedQA-USMLE | 69.59 | **73.72** <span style="color: #2E8B57; font-size: small;">(+4.13)</span> |
37
+ | **Out-of-Domain**| MedMCQA | 62.13 | **64.66** <span style="color: #2E8B57; font-size: small;">(+2.53)</span> |
38
+ | **Out-of-Domain**| GPQA (Med+) | 50.67 | **56.80** <span style="color: #2E8B57; font-size: small;">(+6.13)</span> |
39
+ | **Out-of-Domain**| MMLU-Pro (Med+) | 61.87 | **64.97** <span style="color: #2E8B57; font-size: small;">(+3.10)</span> |
40
 
41
  # <span>Available Weights</span>
42
  The Med-REFL LoRA weights can be applied to the following base models to enhance their medical reasoning abilities.