Skip to content

Convert MediaWiki dumps to Neo4j databases, with all nodes linked using NLP.

Notifications You must be signed in to change notification settings

bevan-philip/wiki-to-neo4j

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 

Repository files navigation

wiki-to-neo4j

wiki-to-neo4j is a project that convert a MediaWiki wiki dump, into a Neo4j database.

It achieves this via extracting Infobox template information, and using natural language processing to get further information from the text on the page.


filter.py

filter.py is a filtering program that reduces the size of MediaWiki dumps, by removing miscallaneous information, and pages not within the selected namespace.

filter.py -h for usage instructions.

wiki4j.py

wiki4j.py takes a MediaWiki database as an input, and converts it into a Neo4j graph database.

wiki4j.py -h for usage instructions.

About

Convert MediaWiki dumps to Neo4j databases, with all nodes linked using NLP.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages