Skip to content

sg0hsmt/budoux-rs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BudouX-rs

Crates.io API reference Test License

BudouX-rs is a rust port of BudouX (machine learning powered line break organizer tool).

Note: This project contains the deliverables of the BudouX project.

Note: BudouX-rs supported plain text only, not supports html inputs.

Demo

https://sg0hsmt.github.io/budoux-rs/

Documentation

https://docs.rs/crate/budoux/

Usage

Split sentences with internal model.

let model = budoux::models::default_japanese_model();
let words = budoux::parse(model, "これはテストです。");

assert_eq!(words, vec!["これは", "テストです。"])

Load model from json file and split sentences using the loaded model.

let file = File::open(path_to_json).unwrap();
let reader = BufReader::new(file);
let model: budoux::Model = serde_json::from_reader(reader).unwrap();
let words = budoux::parse(&model, "これはテストです。");

assert_eq!(words, vec!["これは", "テストです。"])

Test

cargo test

You can use GitHub Actions locally by act.

act -j test

Generate model from original BudouX

go generate ./...

Note: Generate model is require Go 1.13 or later.

About

BudouX-rs is a rust port of BudouX (machine learning powered line break organizer tool).

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages