Archives Aug. 18, 2014
Seem to need hidden-markov-models for text extraction...
Written by
on
in
Snaking,
Tuxedo.
So in order to "seed" listener with text-as-it-would-be-spoken for coding, I've built up a tokenizer that will parse through a file and attempt to produce a model of what would have been said to dictate that text. The idea being that we want to generate a few hundred megabytes of sample statements that can then ...