Automated Content Grading Using Machine Learning

This is a primitive research experiment in automation of grading of Exam Papers in Technical domain, currently only for theoretical content in the answer papers, which constitutes for the major part. Automation of the grading can significantly reduce the time and effort required by the human graders, and bring forth other advantages such as disinterested or unbiased checking, efficient and less subjective to human errors. Machine learning models implemented in this project include Bag of words, bag of vectors, bag of centroids, on the manually build datasets from exams given by students enrolled in technical domain courses in Amity University, Noida, India.

Motivation

Many organizations and educational institutions have already introduced computer based tests, even for text based exams. Our project aims at solving the problem of automated grading of these tests. Also, text based content from hand written papers (after OCR to text processing), can similarly be checked and graded as well, although research on this is in progress.

Hypothesis & Challenges

Technical domain As many systems are developed already to solve general content, e.g. Grading of English essays, etc; our approach is to solve the content in technical domain, which had not yet been successfully attempted to be solved. Technical domain would include courses like Bachelor of Science, Master of Science, Bach. Of Technology & Engineering, Arts, etc. and not General English based, e.g. Psychology, Law etc.
Computer Based Tests The exams of whom the content is to be graded by this approach will give us text input through the computer based examination. Many organizations today are adopting computer based exams.
Data collected Only the “theoretical content” can be graded using this approach is to be graded, as it takes about 75-85% of total time for a human grader to check and grade a paper for only the theoretical content. Checking of other types of content, e.g. Illustrative, diagrammatic, numeric etc. takes a very jiffy amount of time as compared to theoretical content. As a result, we can achieve significant amount of time being reduced while checking the theoretical content i.e. an expected 60-80% reduction in time spent by a human grader as compared to a computer grading system.
Large collection of data: As the content to be checked for human graders is generally large, it is expected that this system be implemented on domains with huge data content to be processed

Software tools used

We chose to implement our model in Python 2.7.x, as there is a vast set of libraries for working with natural language processing. We have used the Natural Language Toolkit (NLTK) and textmining for most NLP tasks. Other libraries: numpy, scipy, xlrd, xlwt, word2vec etc. have been used for various tasks.

Authors

Rahul Chauhan
Ravinder Saharan
Siddhartha Singh

Advisor

Preeti Sharma

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Sample Data & Results		Sample Data & Results
.gitignore		.gitignore
300features_40minwords_10context		300features_40minwords_10context
ACGML Paper.pdf		ACGML Paper.pdf
LICENSE		LICENSE
Readme.md		Readme.md
Word2VecUtility.py		Word2VecUtility.py
Word2Vec_AverageVectors.py		Word2Vec_AverageVectors.py
Word2Vec_BagOfCentroids.py		Word2Vec_BagOfCentroids.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample Data & Results

Sample Data & Results

.gitignore

.gitignore

300features_40minwords_10context

300features_40minwords_10context

ACGML Paper.pdf

ACGML Paper.pdf

LICENSE

LICENSE

Readme.md

Readme.md

Word2VecUtility.py

Word2VecUtility.py

Word2Vec_AverageVectors.py

Word2Vec_AverageVectors.py

Word2Vec_BagOfCentroids.py

Word2Vec_BagOfCentroids.py

requirements.txt

requirements.txt

Repository files navigation

Automated Content Grading Using Machine Learning

Motivation

Hypothesis & Challenges

Software tools used

Authors

Advisor

About

Releases

Packages

Contributors 3

Languages

License

rkc007/Automated-Content-Grading-Using-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Automated Content Grading Using Machine Learning

Motivation

Hypothesis & Challenges

Software tools used

Authors

Advisor

About

Topics

Resources

License

Stars

Watchers

Forks

Languages