this post was submitted on 12 Jun 2023
40 points (100.0% liked)

Rust Lang

10 readers
1 users here now

Rules [Developing]

Observe our code of conduct

Constructive criticism only

No endless relitigation

No low-effort content

No memes or image macros

No NSFW Content

founded 1 year ago
MODERATORS
 

Any project in rust you want to chat about.

you are viewing a single comment's thread
view the rest of the comments
[–] NovelKeysInk@discuss.tchncs.de 2 points 1 year ago (2 children)

I started working on a text analysis crate during grad school. I've gotten some surface-level stuff in there already, but I'd like to flesh it out and add some more true NLP functionality.

Link: https://github.com/michael-long88/rnltk

[–] jasory@lemmyrs.org 2 points 1 year ago (1 children)

Does this detect stylistic similarity between texts? I was thinking about porting a similar project that was developed to determine probable authorship. It would be quite useful as an open-source bot detector.

Nothing quite that advanced. It's mostly just stemming, basic tokenization, TF-IDF, and cosine similarity at this point.