No description
Find a file
Sander Hautvast 742800e5cb added readme
2022-04-29 16:13:28 +02:00
.idea first commit: padding, and draft for ngrams, probably useless 2022-04-28 16:18:17 +02:00
src added readme 2022-04-29 16:13:28 +02:00
.gitignore and gitignores 2022-04-28 16:22:12 +02:00
Cargo.lock getting there 2022-04-29 12:27:22 +02:00
Cargo.toml getting there 2022-04-29 12:27:22 +02:00
README.md added readme 2022-04-29 16:13:28 +02:00

RLTK

An attempt to manually port some of nltk to rust.

Currently in it's infancy:

  • rltk::lm::preprocessing::pad_both_ends(["a","b","c"], 2) -> "<s>", "a", "b", "c", "</s>"]
  • rltk::util::pad_sequence == same as above with customisation
  • rltk::util::pad_sequence_left == same
  • rltk::util::pad_sequence_right == same
  • rltk::util::ngrams(["a","b","c"],2) -> "a"], ["b"], ["b"], ["c"
  • rltk::util::bigrams(["a","b","c"]) == ngrams(..., 2)
  • rltk::util::trigrams(["a","b","c"]) == ngrams(..., 3)