Distributions¶
|
Count frequency of tokens |
|
Compute tfidf |
|
Based on tfidf scores, return most important words per corpus |
|
Based on tfidf scores, return most important words per document |
Word Vectors¶
|
Compute word vectors from corpus |
|
Find similar words to word |
|
Perform word vector maths |
|
Save word vector model to file |
|
Load word vector model from file |
Topic Modelling¶
|
LDA Topic Modelling with BoW |
|
LDA Topic Modelling with TF-IDF |
|
Print words corresponding to topic modelling |
|
determine document topics |
Sentiment Analysis¶
|
Compute sentiment strength of ENGLISH texts |
Keyword Extraction¶
|
Extract keywords from a piece of text |