enlp.understanding.distributions.important_words_per_corpus¶
-
enlp.understanding.distributions.important_words_per_corpus(scores, n=10)[source]¶ Based on tfidf scores, return most important words per corpus
- Parameters
- scores
pandas.DataFrame pandas dataframe where every word is a feature and every document is an observation, computed by compute_tfidf method
- n
int number of important words to return
- scores
- Returns
- imp_words
list list of tuples of important word and their average tfidf score across the corpus
- imp_words