Simplify model performance table

This commit is contained in:
KennyDizi
2024-06-01 08:05:37 +07:00
parent 37f6e18953
commit ea7a84901d

View File

@ -11,116 +11,22 @@ Here are the results:
<br> <br>
<br> <br>
<head> | Model name | Model size [B] | Better than gpt-4 rate, after fine-tuning [%] |
<meta charset="UTF-8"> |-----------------------------|----------------|----------------------------------------------|
<meta name="viewport" content="width=device-width, initial-scale=1.0"> | **DeepSeek 34B-instruct** | **34** | **40.7** |
<style> | DeepSeek 34B-base | 34 | 38.2 |
td, th { | Phind-34b | 34 | 38 |
font-size: 16px; /* Adjust this value to your preference */ | Granite-34B | 34 | 37.6 |
} | Codestral-22B-v0.1 | 22 | 32.7 |
table { | QWEN-1.5-32B | 32 | 29 |
width: 100%; | | | |
border-collapse: collapse; | **CodeQwen1.5-7B** | **7** | **35.4** |
} | Granite-8b-code-instruct | 8 | 34.2 |
th { | CodeLlama-7b-hf | 7 | 31.8 |
background-color: #f2f2f2; | Gemma-7B | 7 | 27.2 |
border: 1px solid #dddddd; | DeepSeek coder-7b-instruct | 7 | 26.8 |
text-align: center; | Llama-3-8B-Instruct | 8 | 26.8 |
padding: 8px; | Mistral-7B-v0.1 | 7 | 16.1 |
}
td {
border: 1px solid #dddddd;
text-align: center;
padding: 8px;
}
tr:nth-child(even) {
background-color: #f9f9f9;
text-align: center;
}
</style>
<title>Model Performance Table</title>
</head>
<body>
<table>
<tr>
<th align="center">Model name</th>
<th align="center">Model size [B]</th>
<th align="center">Better than gpt-4 rate, after fine-tuning [%]</th>
</tr>
<tr>
<td align="center"><b>DeepSeek 34B-instruct</b></td>
<td align="center"><b>34</b></td>
<td align="center"><b>40.7</b></td>
</tr>
<tr>
<td align="center">DeepSeek 34B-base</td>
<td align="center">34</td>
<td align="center">38.2</td>
</tr>
<tr>
<td align="center">Phind-34b</td>
<td align="center">34</td>
<td align="center">38</td>
</tr>
<tr>
<td align="center">Granite-34B</td>
<td align="center">34</td>
<td align="center">37.6</td>
</tr>
<tr>
<td align="center">Codestral-22B-v0.1</td>
<td align="center">22</td>
<td align="center">32.7</td>
</tr>
<tr>
<td align="center">QWEN-1.5-32B</td>
<td align="center">32</td>
<td align="center">29</td>
</tr>
<tr>
<td align="center"></td>
<td align="center"></td>
<td align="center"></td>
</tr>
<tr>
<td align="center"><b>CodeQwen1.5-7B</b></td>
<td align="center"><b>7</b></td>
<td align="center"><b>35.4</b></td>
</tr>
<tr>
<td align="center">Granite-8b-code-instruct</td>
<td align="center">8</td>
<td align="center">34.2</td>
</tr>
<tr>
<td align="center">CodeLlama-7b-hf</td>
<td align="center">7</td>
<td align="center">31.8</td>
</tr>
<tr>
<td align="center">Gemma-7B</td>
<td align="center">7</td>
<td align="center">27.2</td>
</tr>
<tr>
<td align="center">DeepSeek coder-7b-instruct</td>
<td align="center">7</td>
<td align="center">26.8</td>
</tr>
<tr>
<td align="center">Llama-3-8B-Instruct</td>
<td align="center">8</td>
<td align="center">26.8</td>
</tr>
<tr>
<td align="center">Mistral-7B-v0.1</td>
<td align="center">7</td>
<td align="center">16.1</td>
</tr>
</table>
</body>
<br> <br>
@ -191,4 +97,4 @@ why: |
are practical improvements. In contrast, Response 2 focuses more on general advice and less are practical improvements. In contrast, Response 2 focuses more on general advice and less
actionable suggestions, such as changing variable names and adding comments, which are less actionable suggestions, such as changing variable names and adding comments, which are less
critical for immediate code improvement." critical for immediate code improvement."
``` ```