Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create an HDT in one pass. #186

Open
wants to merge 4 commits into
base: develop
Choose a base branch
from
Open

Create an HDT in one pass. #186

wants to merge 4 commits into from

Conversation

mielvds
Copy link
Member

@mielvds mielvds commented Jul 30, 2018

This PR is preliminary work to fix #47 and is not at all finished. However, I hope this starts a discussion that will lead to a possible solution. This is loosely based on what was implemented for Java.

General approach:

  • fix PlainDictionary.cpp to return inserted IDs.
  • add a loadOnePassmethod to BasicHDT.cpp to combine/replace loadDictionary()and loadTriples() which
    • builds a temporary dictionary
    • encodes all triples with IDs given by that temp. dictionary
    • create the final dictionary
    • update all triple IDs according to the new dictionary
  • update rdf2hdt.cpp to supply a 'loader type'

This is not at all performant yet, as my C++ knowledge is also so so. Therefore, I hope to receive enough help and feedback through this PR :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

How to load a gzipped N-Triples file?
1 participant