Improve readability of results from stochastic block modeling
Visual:
- tree representation
- browseable text files / spreadsheet
- interactive html
Filter vocabulary:
- by word specificity:
d / ( n * ((1-1/n)^f) )
where d is num docs containing word, f is total appearances of word, n is num docs in corpus