Innovation

Simple Text Mining

University of Michigan
posted on 08/27/2010

Simple Text Mining


Innovation Details
 

Detailed Description

Background
Currently, there is a lack of text/network mining software available to the typical analyst end-user. Generally available text mining algorithms require extensive programming to implement. Typically, these more complex algorithms have an extremely steep learning curve, requiring a long-term commitment of professional software developer resources. Such solutions usually cannot be implemented by the typical analyst or small business.
Technology
The University of Michigan has developed an Excel-based tool and algorithm for text mining that 'reads' blocks of unstructured text for each word in a lexicon (supplied by the user) and assembles the words found into a common network analysis data structure called an "edge list." This analysis includes additional descriptive data concerning the weight of lexicon words found. This 'weight' output allows for analysis of terms found. The network output allows for analysis of term "adjacency," i.e. appearing together in the same block of unstructured text, the computation of network analysis measures, and the production of network visualizations. Outputs include user-specified data dimensions, carried over from the text input, for easily cross-referenced and more descriptive output.
Applications and Advantages
Applications
• Analysis of unstructured text for a large number of known lexical terms
• Analysis of occurrence and adjacency (co-occurrence) of terms in papers, abstracts, etc.
Advantages
• Approachability/ease-of-use (single-click processing of input text)
• Easy copy/paste of input/output data

File Number: 4730 


IP Protection


License Online

4730 Opensource License

Item type: Software
view license

This is an Excel document with embedded algorithms. It is licensed for use under a type of Opensource License. By continuing to download and use, y...
[more information...]

[Edit] [Delete] [Test]

$0.00

License Now

People

Case Manager:

Doug Hockstad Doug Hockstad

Innovations (136)


Download Technology Brief (PDF)


Followed By

Follow this innovation

Icon_avatar

StandardStapler

Member since
Jul 2011

Icon_avatar

Pedro Celestino

Member since
May 2011

Icon_avatar

dold

Member since
Aug 2011

Icon_avatar

gatemp

Member since
Aug 2011

Icon_avatar

S M

Member since
Nov 2011

Organization
Communities
Profile
Related Tags

Find more innovations


February 11, 2009

7,863 members 17,201 innovations 152 organizations

Browse

Dr. Jörg Knäblein – Technology Scouting, Bayer Schering Pharma AG

"Through the iBridge Network, I was able to find a mouse model I was looking for. The collaboration available through the iBridge Network is crucial in driving innovation and I'll continue using it as a valued resource."  read more...