enlp.understanding.distributions.important_words_per_corpus

enlp.understanding.distributions.important_words_per_corpus(scores, n=10)[source]

Based on tfidf scores, return most important words per corpus

Parameters
scorespandas.DataFrame

pandas dataframe where every word is a feature and every document is an observation, computed by compute_tfidf method

nint

number of important words to return

Returns
imp_wordslist

list of tuples of important word and their average tfidf score across the corpus