Solving Math Word Problems concerning Systems of Equations with GPT-3
Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One chal- lenging set of tasks for school students is learning to solve math word problems.
Implementation
Source publication / research team or educational organization described in paper
Learning context
Research / curriculum design context
AI role
Co-creator
Outcome signal
Conceptual understanding
Registry Facets
- Unspecified / broad education
- Math education
- LLMs
- LLM/Chat
- Generative AI
- Instructional design / AI education
- Students
- Researchers
- LLM/Chat
- Generative AI
- Research / curriculum design context
- Design / conceptual evidence
- Conceptual understanding
- Engagement / motivation
Implementing Organization
Source publication / research team or educational organization described in paper
Not specified in extracted text
Researchers, educators, instructors, or facilitators as described in the source publication
Learning Context
- Research / curriculum design context
Classroom, course, or resource-based AI education activity
Not specified in extracted text
Not specified in extracted text
LLM/Chat, Generative AI
- AI output reliability, hallucination, academic integrity, and age-appropriate use require safeguards.
Learner Profile
Unspecified / broad education
Mixed or not explicitly specified; infer from target learner group and intervention design.
Varies by intervention; not specified unless the paper explicitly describes prerequisites.
Educational Intent
- Document the AI education intervention, course, tool, or resource described in the source publication.
- Extract the learner context, AI role, pedagogy, outcomes, and constraints for AAB registry comparison.
- Researchers have been interested in developing AI tools to help students learn various mathematical subjects.
- Support AAB comparison across AI literacy, AI education, teacher training, higher education, and workforce contexts.
- Capture evidence maturity, transferability, and limitations rather than treating the publication as product endorsement.
- Not an AAB endorsement of the tool, curriculum, provider, or result.
- Not a direct replication record unless the source paper reports implementation details sufficient for replication.
AI Tool Description
LLM/Chat, Generative AI
Language context discussed in source publication
- Co-creator
- Primary interaction pattern inferred from publication: Instructional design / AI education.
- AI capability focus: LLM/Chat, Generative AI.
- Require human review of generated outputs and explicit guidance against over-reliance or answer copying.
Activity Design
- Review the publication’s reported context, learner group, AI tool or curriculum, implementation process, and outcome evidence.
- Map the case to AAB registry fields for comparison across educational levels and AI capability types.
- Use the source publication and PDF for any manual verification before public registry release.
- Human educators/researchers remain responsible for instructional design, supervision, interpretation, and ethical safeguards.
- AI systems or AI concepts provide the learning object, support tool, evaluator, simulator, or automation context depending on the paper.
- Instructional / curriculum-based learning
- Registry extraction emphasizes explicit learning goals, observed outcomes, constraints, and safety limitations.
Observed Challenges
- AI output reliability, hallucination, academic integrity, and age-appropriate use require safeguards.
Design Adaptations
- Case classified under: Published paper.
- Pedagogical pattern: Instructional / curriculum-based learning.
- Any additional adaptations should be verified against the full paper before public-facing publication.
Reported Outcomes
- Engagement evidence should be interpreted according to the source paper’s reported method and sample.
- One chal- lenging set of tasks for school students is learning to solve math word problems.
- One chal- lenging set of tasks for school students is learning to solve math word problems.
- We explore how recent advances in natural language processing, specifically the rise of power- ful transformer based models, can be applied to help math learners with such problems.
Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One chal- lenging set of tasks for school students is learning to solve math word problems.
Ethical & Privacy Considerations
- Require human review of generated outputs and explicit guidance against over-reliance or answer copying.
Evidence Type
- Design / conceptual evidence
Relevance to Research
- Can be used as an AAB evidence record for cross-case comparison, standards drafting, and evidence-maturity mapping.
- Supports identification of recurring patterns in AI literacy, AI education implementation, teacher preparation, assessment, and responsible AI learning.
- Conceptual understanding
- Engagement / motivation
- Instructional design / AI education
- LLM/Chat
- Generative AI
Case Status
- Completed
AAB Classification Tags
Unspecified / broad education
Research / curriculum design context
LLM/Chat, Generative AI
Instructional / curriculum-based learning
Medium
Medium
Source Publication
Solving Math Word Problems concerning Systems of Equations with GPT-3
- Mingyu Zong
- Bhaskar Krishnamachari
Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37 No. 13, EAAI-23
2023
10.1609/aaai.v37i13.26896
https://ojs.aaai.org/index.php/AAAI/article/view/26896
https://ojs.aaai.org/index.php/AAAI/article/view/26896/26668
087_Solving Math Word Problems Concerning Systems of Equations with GPT-3.pdf
8
Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One chal- lenging set of tasks for school students is learning to solve math word problems. We explore how recent advances in natural language processing, specifically the rise of power- ful transformer based models, can be applied to help math learners with such problems. Concretely, we evaluate the use of GPT-3, a 1.75B parameter transformer model recently re- leased by OpenAI, for three related challenges pertaining to math word problems corresponding to systems of two linear equations. The three challenges are classifying word prob- lems, extracting equations from word problems, and generat- ing word problems. For the first challenge, we define a set of problem classes and find that GPT-3 has generally very high accuracy in classifying word problems (80%-100%), for all but one of these classes. For the second challenge, we find the accuracy for extracting equations improves with number of examples provided to the model, ranging from an accu- racy of 31% for zero-shot learning to about 69% using 3-shot learning, which is further improved to a high value of 80% with fine-tuning. For the third challenge, we find that GPT-3 is able to generate problems with accuracy ranging from 33% to 93%, depending on the problem type.
Transferability
- Research / curriculum design context
- AI output reliability, hallucination, academic integrity, and age-appropriate use require safeguards.
Cost And Operations
Not specified in extracted text unless noted in duration field.
Requires educators/researchers/facilitators with sufficient AI literacy and pedagogy knowledge for the target learners.
Infrastructure depends on AI tool type, learner devices, data access, and institutional policy context.
Extraction Notes
High
- group_size
- duration
This entry was automatically extracted from the PDF text and manifest metadata. Fields should be manually verified before public registry publication, especially group size, location, duration, and outcome claims.
Empowering Children’s AI Literacy Through Co-Creating Stories with LLM
0.417
false
