No description
Find a file
Sander Hautvast 9e919336be wip
2022-12-18 17:27:56 +01:00
.github/workflows Create rust.yml 2022-04-29 16:16:03 +02:00
examples delete crap 2022-05-10 21:58:25 +02:00
src wip 2022-12-18 17:27:56 +01:00
.gitignore wip 2022-12-18 17:27:56 +01:00
Cargo.lock wip 2022-12-18 17:27:56 +01:00
Cargo.toml wip 2022-12-18 17:27:56 +01:00
extract wip 2022-12-18 17:27:22 +01:00
nl.dict wip 2022-12-18 17:27:22 +01:00
README.md added new function to readme 2022-05-10 17:25:52 +02:00

RLTK

An attempt to manually port some of nltk to rust.

from https://www.nltk.org/api/nltk.lm.html:

So as to avoid re-creating the text in memory, both train and vocab are lazy iterators. They are evaluated on demand at training time.

rltk has the same philosophy: everything is done using iterators (on iterators) on string slices.

Currently in it's infancy (but growing):

  • rltk::lm::preprocessing::pad_both_ends
  • rltk::lm::preprocessing::padded_everygrams
  • rltk::util::pad_sequence
  • rltk::util::pad_sequence_left
  • rltk::util::pad_sequence_right
  • rltk::util::ngrams
  • rltk::util::bigrams
  • rltk::util::trigrams
  • rltk::util::everygrams
  • rltk::util::flatten
  • rltk::metrics::distance::edit_distance