Home > Blogs > Scott Spangler

RSS FeedSubscribeRSS details

Mining for Typical Examples in a Collection

There are many ways that text mining can help the analyst gain a quick understanding of a collection of similar documents.   Some of these may include summary word statistics describing what frequently occurring words and phrases the documents in the collection share.  Graphical displays such as scatter plots may help to show relationships between documents in a collection.  One of the most powerful ways to understand a collection is by viewing its examples directly.   Unfortunately, this is also a very time consuming approach.   One way to make the process of looking at example documents more effective is through automatic selection of Typical Examples. 

Continue reading...

By Scott SpanglerAugust 24, 2007
Topics: Computer Science

More data isn't always a good thing.

One of the questions we face in mining text data is how much data do we really need to draw useful conclusions.

Continue reading...

By Scott SpanglerAugust 15, 2007
Topics: Computer Science