The fullest library for creating nonconsumptive files is the Python module nonconsumptive.
Depending on what format your texts are in, it may be very easy to create a representation without coding in python.
If you have a few hundred texts located inside a folder called
and associate metadata in ‘meta.csv’, you can run the following command to
create a set of bookstacks.
NOT IMPLEMENTEDJoin us to help develop it
pip install nonconsumptive nonconsumptive build --texts texts --metadata meta.csv --metadata-id-field filename --targets unigrams bigrams stacks srp --dir nc
Once you have done so, host it online and add the package to our registry to allow others to work with it.
For more information, see the python docs.
Nonconsumptive access in R is handled through the Apache-arrow package; we recommend tidytext for exploring the data that it produces.
The underlying data architecture here is designed to work seamlessly with a variety of other files.