eaton, Okay, #datascience and #nlp friends. I’m poking around for the “right way” to approach a problem: I want to calculate the overal homogeneity of many short snippets of text (phrases and sentences), and many large spans of text (500-1500 word documents).