pliers.extractors.TextVectorizerExtractor¶
- class pliers.extractors.TextVectorizerExtractor(vectorizer=None, *vectorizer_args, **vectorizer_kwargs)[source]¶
Bases:
BatchTransformerMixin
,TextExtractor
Uses a scikit-learn Vectorizer to extract bag-of-features from text.
- Parameters
vectorizer (sklearn Vectorizer or str) – a scikit-learn Vectorizer (or the name in a string) to extract with. Will use the CountVectorizer by default. Uses supporting *args and **kwargs.
- transform(stim, *args, **kwargs)¶
Executes the transformation on the passed stim(s).
- Parameters
One or more stimuli to process. Must be one of:
A string giving the path to a file that can be read in as a Stim (e.g., a .txt file, .jpg image, etc.)
A Stim instance of any type.
An iterable of stims, where each element is either a string or a Stim.
validation (str) –
String specifying how validation errors should be handled. Must be one of:
’strict’: Raise an exception on any validation error
’warn’: Issue a warning for all validation errors
’loose’: Silently ignore all validation errors
args – Optional positional arguments to pass onto the internal _transform call.
kwargs – Optional positional arguments to pass onto the internal _transform call.