generated from allenai/python-package-template
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
11 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -149,7 +149,7 @@ <h2 class="subtitle is-3 publication-subtitle" style="margin-bottom: 32px;"> | |
<span class="author-block"><sup style="color:#339cff">3</sup>Insource Services Inc,</span> | ||
<span class="author-block"><sup style="color:#ffac33">4</sup>Teaching Lab,</span> | ||
<span class="author-block"><sup style="color:#d84dda">5</sup>Allen Institute for AI</span><br> | ||
<span class="paper-block"><b style="color:#f41c1c">NeurIPS 2024, Math AI Workshop</b></span> | ||
<span class="paper-block"><b style="color:#f41c1c">NeurIPS 2024, Math-AI Workshop</b></span> | ||
</div> | ||
|
||
<div class="column has-text-centered" style="margin-top: 32px;"> | ||
|
@@ -337,7 +337,7 @@ <h2 class="title is-2" style="margin-top: 128px;">Introduction</h2> | |
|
||
<h2 class="title is-2" id="leaderboard">Leaderboard on DrawEduMath</h2> | ||
<div class="content"> | ||
<p class="mt-3"> Accuracy Scores on the | ||
<p class="mt-3"> Accuracy scores on the | ||
<img src="main_static/images/logos/drawedumath_logo.png" style="width:2.0em;vertical-align: middle" | ||
alt="Logo" /> | ||
<span class="drawedumath">DrawEduMath</span> dataset. | ||
|
@@ -383,7 +383,7 @@ <h2 class="title is-2" id="leaderboard">Leaderboard on DrawEduMath</h2> | |
|
||
|
||
<div> | ||
<p>The leaderboard scores are based on the judgements using <b>Mixtral 8x22B model</b>.</p> | ||
<p>The leaderboard scores are based on similarity judgements of VLMs' answers to gold ones obtained using a <b>Mixtral 8x22B model</b>.</p> | ||
<p>🚨 To submit your results to the leaderboard, please send to <a href="mailto:[email protected]">this | ||
email</a> with your result json files.</p> | ||
</p> | ||
|
@@ -425,9 +425,7 @@ <h2 class="title is-2">Overview</h2> | |
src="main_static/images/logos/assistments_a_logo.png" style="width:1.5em;vertical-align: middle" | ||
alt="Logo" />ASSISTments</a> | ||
online learning platform, where students receive feedback from teachers on assigned work. | ||
The problems that accompany each student response are drawn from three overlapping1 open educational | ||
resources (OER): Eureka Math, Open Up | ||
Resources, and Illustrative Math. | ||
The problems that accompany each student response are drawn from three overlapping open educational resources (OER): Eureka Math, Open Up Resources, and Illustrative Math. | ||
|
||
</p> | ||
|
||
|
@@ -451,8 +449,8 @@ <h2 class="title is-2">Overview</h2> | |
|
||
|
||
<p> | ||
You can download the dataset on <a href="https://huggingface.co/datasets/Heffernan-WPI-Lab/DrawEduMath" | ||
target="_blank">Hugging Face Dataset</a>. | ||
In the future, we will release the dataset on <a href="https://huggingface.co/datasets/Heffernan-WPI-Lab/DrawEduMath" | ||
target="_blank">Hugging Face</a>, but in the meantime, fill out this <a href="https://forms.gle/KEjDq4NpSDDm2e36A">Google form</a> to express interest. | ||
</p> | ||
|
||
</div> | ||
|
@@ -488,9 +486,8 @@ <h2 class="title is-2">Overview</h2> | |
<div class="columns is-centered m-6"> | ||
<div class="column is-full has-text-centered content"> | ||
<h2 class="title is-3">Examples</h2> | ||
<p>Examples of teacher’s answers to a question asking about possible errors in students’ responses to math | ||
problems. All three examples of students’ hand-drawn responses are for the same math problem asking students | ||
to | ||
<p>Here are examples of teachers' answers to a question asking about possible errors in students’ responses to math | ||
problems. All three examples of students’ hand-drawn responses are for the same math problem asking students to | ||
draw and shade units on fraction strips to show 4 thirds, shown on the left. | ||
</p> | ||
<img src="main_static/images/TeacherQA.png" alt="Example of teachers' answers to question about erro" | ||
|
@@ -521,10 +518,10 @@ <h2 class="title is-3">Statistics</h2> | |
<div class="content has-text-centered"> | ||
<img src="main_static/images/plots/question_type_statistics.png" | ||
alt="Overall question types in our VQA benchmark" width="70%" /> | ||
<p>Qualitative examples of the most common question types in our <img | ||
<p>Examples of the most common question types in our <img | ||
src="main_static/images/logos/drawedumath_logo.png" style="width:2.0em;vertical-align: middle" | ||
alt="Logo" /> | ||
<span class="drawedumath">DrawEduMath</span> benchmark, categornized by type. <br /> | ||
<span class="drawedumath">DrawEduMath</span> benchmark, categorized by type. <br /> | ||
</p> | ||
</div> | ||
</div> | ||
|
@@ -538,7 +535,7 @@ <h2 class="title is-3">Statistics</h2> | |
<!-- RESULTS SECTION --> | ||
<section class="hero is-light is-small"> | ||
<div class="hero-body has-text-centered"> | ||
<h1 class="title is-1 mathvista">Experiment Results</h1> | ||
<h1 class="title is-1 mathvista">Experimental Results</h1> | ||
</div> | ||
</section> | ||
|
||
|