Carachterize subcorpora
One we've identified an interesting subcorpus, how do we describe it?
For example, the subcorpus of "most futuristic" articles.
For example, can we say:
- it uses certain terms (drugs, diseases, techniques, treatments) more often than the full corpus