Installation¶

Pip¶

The easiest way to install pliers is with pip. For the latest stable release:

pip install pliers

Or, if you want to work on the bleeding edge:

pip install pliers git+https://github.com/psychoinformaticslab/pliers.git

Dependencies¶

By default, installing pliers with pip will only install third-party libraries that are essential for pliers to function properly. These libraries are listed in requirements.txt. However, because pliers provides interfaces to a large number of feature extraction tools, there are literally dozens of other optional dependencies that may be required depending on what kinds of features you plan to extract (see optional-dependencies.txt). To be on the safe side, you can install all of the optional dependencies with pip:

pip install pliers[all]

Or, if you installed via GitHub clone:

pip install -r optional-dependencies.txt

Note, however, that some of these Python dependencies have their own (possibly platform-dependent) requirements. Most notably, python-magic requires libmagic (see python-magic for installation instructions), and without this, you’ll be relegated to loading all your stims explicitly rather than passing in filenames (i.e., stim = VideoStim('my_video.mp4') will work fine, but passing ‘my_video.mp4’ directly to an Extractor will not). Additionally, the Python OpenCV bindings require OpenCV3 (which can be a bit more challenging to install)–but relatively few of the feature extractors in pliers currently depend on OpenCV, so you may not need to bother with this. Similarly, the TesseractConverter requires the tesseract OCR library, but no other Transformer does, so unless you’re planning to capture text from images, you’re probably safe.

Docker¶

If you have docker installed on your machine, you can run:

docker run -p 8888:8888 ghcr.io/psychoinformaticslab/pliers:unstable

The above command will open a jupyter notebook you can connect to by copy/pasting the url displayed on the terminal with the form http://127.0.0.1:8888/?token=<some_token> to your browser. The unstable tag can be replaced with a versioned tag like 0.4.2. If you wish to publish data/results using pliers, please use a versioned tag. Using a versioned tag ensures the image you are using today will be the same for all time, whereas the unstable tag will change periodically as pliers is updated. You can see available versioned tags on github.

If you want to save your work you will need to mount a directory onto the docker image. By default, the jupyter notebook opens in the /work directory, so we will mount a directory in the /work directory:

docker run -v /my/directory:/work -p 8888:8888 ghcr.io/psychoinformaticslab/pliers:unstable

where /my/directory is replaced with a path to a directory that you want to save.

Note

If you are using windows, here is the way to specify a directory -v 'C:\My Dir\With Spaces:/work'. Use quotes to surround the entire argument, not just the windows path.

If you want to build your own pliers docker image, you will need to clone the pliers repository and use docker build

git clone https://github.com/psychoinformaticslab/pliers.git
docker build --target executable -t pliers:dev -f ./pliers/docker/Dockerfile ./pliers

The flag -t specifies the tagged name of the image, which can be any valid docker image name, pliers:dev is an example name.

API Keys¶

While installing pliers itself is usually straightforward, setting up some of the web-based feature extraction APIs that pliers interfaces with can take a bit more effort. For example, pliers includes support for face and object recognition via Google’s Cloud Vision API, and enables conversion of audio files to text transcripts via several different speech-to-text services. While some of these APIs are free to use (and virtually all provide a limited number of free monthly calls), they all require each user to register for their own API credentials. This means that, in order to get the most out of pliers, you’ll probably need to spend some time registering accounts on a number of different websites. The following table lists all of the APIs supported by pliers at the moment, along with registration URLs:

Transformer class	Web service	Environment variable(s)	Variable description	Example values
WitTranscriptionConverter	Wit.ai speech-to-text API	WIT_AI_API_KEY	Server Access Token	A27C1HPZBEDVLW1T1IJAR3L2Q2DA6K3D
IBMSpeechAPIConverter	IBM Watson speech-to-text API	IBM_USERNAME IBM_PASSWORD	API username and password	98452-bvc42-fd-42221-cv21 (username*) FJ14fns21N1f (password)
GoogleSpeechAPIConverter	Google Cloud Speech API	GOOGLE_APPLICATION_CREDENTIALS	path to service account key	path/to/credentials.json
GoogleVisionAPITextConverter	Google Cloud Vision API	GOOGLE_APPLICATION_CREDENTIALS	path to service account key	path/to/credentials.json
GoogleVisionAPIFaceExtractor	Google Cloud Vision API	GOOGLE_APPLICATION_CREDENTIALS	path to service account key	path/to/credentials.json
GoogleVisionAPILabelExtractor	Google Cloud Vision API	GOOGLE_APPLICATION_CREDENTIALS	path to service account key	path/to/credentials.json
GoogleVisionAPIPropertyExtractor	Google Cloud Vision API	GOOGLE_APPLICATION_CREDENTIALS	path to service account key	path/to/credentials.json
GoogleVisionAPIWebEntitiesExtractor	Google Cloud Vision API	GOOGLE_APPLICATION_CREDENTIALS	path to service account key	path/to/credentials.json
MicrosoftAPITextConverter	Microsoft Computer Vision API	MICROSOFT_VISION_SUBSCRIPTION_KEY MICROSOFT_SUBSCRIPTION_LOCATION	API key and registered region	152b067184e2ae03711e6439de124c27 westus
MicrosoftVisionAPIExtractor (and subclasses)	Microsoft Computer Vision API	MICROSOFT_VISION_SUBSCRIPTION_KEY MICROSOFT_SUBSCRIPTION_LOCATION	API key and registered region	152b067184e2ae03711e6439de124c27 westus
MicrosoftAPIFaceExtractor (and subclasses)	Microsoft Face API	MICROSOFT_FACE_SUBSCRIPTION_KEY MICROSOFT_SUBSCRIPTION_LOCATION	API key and registered region	152b067184e2ae03711e6439de124c27 westus
ClarifaiAPIExtractor (and subclasses e.g ClarifaiAPIImageExtractor)	Clarifai recognition API	CLARIFAI_ACCESS_TOKEN \| Personal access token CLARIFAI_USER_ID \| User name CLARIFAI_APP_ID \| Application name		168ed02e137459ead66c3a661be7b784 user_name my_application
RevAISpeechAPIConverter	Rev.ai speech-to-text API	REVAI_ACCESS_TOKEN	API key	686n83674ab3989d2f5e4aa0aec9f273

* Note that this is not the plaintext e-mail or username for your IBM services account

For the Google application credentials, you’ll need to enable a Service Account on your existing Google Cloud project. You can find more information on this here.

Once you’ve obtained API keys for the services you intend to use, there are two ways to get pliers to recognize and use your credentials. First, each API-based Transformer can be passed the necessary values (or a path to a file containing those values) as arguments at initialization. For example:

from pliers.extractors import ClarifaiAPIImageExtractor
ext = ClarifaiAPIImageExtractor(app_id='my_clarifai_app_id',
                                app_secret='my_clarifai_app_secret')

Alternatively, you can store the appropriate values as environment variables, in which case you can initialize a Transformer without any arguments. This latter approach is generally preferred, as it doesn’t require you to hardcode potentially sensitive values into your code. The mandatory environment variable names for each service are listed in the table above.

from pliers.extractors import GoogleVisionAPIFaceExtractor
# Works fine if GOOGLE_APPLICATION_CREDENTIALS is set in the environment
ext = GoogleVisionAPIFaceExtractor()