Toot

Written by Aaron on 2025-01-16 at 08:31

@mdione Markov chains are extremely simple - and thus, fast. The way I put this one together also trades increased corpus size for more speed. In Nepenthes it has a depth of two, which is rather incoherent but the fastest you'll get with realistic text. I consider that extra incoherence to be a positive thing in this use case.

It's slowed, however, by the fact the corpus it's stored in SQLite, and not RAM. This causes the bottleneck to be IO throughout to disk reads, somewhat mitigated by OS buffering if you have spare memory for it.

Holding the corpus entirely in memory is a thing I've done, but it both consumes a huge amount of RAM and requires retraining at every restart. @Workshopshed

=> More informations about this toot | View the thread | More toots from aaron@zadzmo.org

Mentions

=> View mdione@en.osm.town profile | View Workshopshed@mastodon.scot profile

Toot

Written by Aaron on 2025-01-16 at 08:31

Mentions

Tags