Calmcode - dirty cat: count vectors

Count Vectors

1 2 3 4 5

1. Introduction
2. Count Vectors
3. NGram
4. Similarity
5. Results

If you want to play around with count vectors you can run the code below.

from sklearn.feature_extraction.text import CountVectorizer

cv = CountVectorizer().fit(ml_df['employee_position_title'])
cv.transform(ml_df['employee_position_title']).shape

You can also inspect the vocabulary.

cv.vocabulary_