Installation¶
Installation¶
Stable release¶
To install text_data
without the extra visualization
features, run this command in your terminal:
pip install text_data
If you also want to install the visual features (which
installs altair
and pandas
), run
this command in your terminal
pip install text_data[display]
Or, using poetry
, run
poetry add text_data -E display
This is the preferred method to install Text Data, as it will always install the most recent stable release.
If you don’t have pip installed, this Python installation guide can guide you through the process.
From sources¶
The sources for Text Data can be downloaded from the Github repo.
You can either clone the public repository:
git clone git://github.com/maxblee/text_data
Or download the tarball:
curl -OJL https://github.com/maxblee/text_data/tarball/master
Once you have a copy of the source, you can install it with:
poetry install -E display
Or
poetry install -E display --no-dev
Additional Tutorial Setup¶
If you want to follow along with the tutorial, you should
download the Kaggle State of the Union Corpus,
unzip it, and put it into a directory called sotu-data
.
You should also install the latest version of text_data
with the optional
visualization features. The tutorial shows the process of conducting
an analysis using text_data
and makes extensive use
of pandas
and some use of the build-in visualization
features.