enlp.understanding.distributions.important_words_per_corpus¶
-
enlp.understanding.distributions.
important_words_per_corpus
(scores, n=10)[source]¶ Based on tfidf scores, return most important words per corpus
- Parameters
- scores
pandas.DataFrame
pandas dataframe where every word is a feature and every document is an observation, computed by compute_tfidf method
- n
int
number of important words to return
- scores
- Returns
- imp_words
list
list of tuples of important word and their average tfidf score across the corpus
- imp_words