Installation

Installation

Stable release

To install text_data without the extra visualization features, run this command in your terminal:

pip install text_data

If you also want to install the visual features (which installs altair and pandas), run this command in your terminal

pip install text_data[display]

Or, using poetry, run

poetry add text_data -E display

This is the preferred method to install Text Data, as it will always install the most recent stable release.

If you don’t have pip installed, this Python installation guide can guide you through the process.

From sources

The sources for Text Data can be downloaded from the Github repo.

You can either clone the public repository:

git clone git://github.com/maxblee/text_data

Or download the tarball:

curl -OJL https://github.com/maxblee/text_data/tarball/master

Once you have a copy of the source, you can install it with:

poetry install -E display

Or

poetry install -E display --no-dev

Additional Tutorial Setup

If you want to follow along with the tutorial, you should download the Kaggle State of the Union Corpus, unzip it, and put it into a directory called sotu-data.

You should also install the latest version of text_data with the optional visualization features. The tutorial shows the process of conducting an analysis using text_data and makes extensive use of pandas and some use of the build-in visualization features.