Data Mining in Business Research Executive Summary Term Paper

Data Mining in Business Research

Executive Summary

The perfect storm is brewing over analytics and their use to define the

future of data mining and its associated techniques and technologies,

(Columbus 1). The use of data mining, its adjunct technologies for text

mining and the ability to interpret, analyze and create linguistic models

from unstructured content is revolutionizing the concept of data mining

away from being purely used for structured content in data warehouses to

now encompass unstructured content found throughout organizations globally.

Text mining's growth is attributable to many factors, which include but

are not limited to the following. First, there is the rapid growth and

popularity of Google, and its search engine technologies and many patents

on taking unstructured text content and turning it into intelligence

(Google, Their most famous patents surround latent semantic

indexing, an approach where a model is constructed from the similarities of

linguistic content to create threads of intelligence on specific topics.

There's also the development by some of the world's largest software

companies in the area of Natural Language Processing (IBM Natural Language

Processing, 3) , where unstructured or un-organized content is captured,

analyzed for trends and assumptions of their interrelationships, and then

interpreting the "chatter" of terrorists and the intercepted cell phone, e-

mail, and Instant Messaging done between cells in the United States and

other nations. This technology can create in effect a hierarchy of senders

and receivers of content and what their key messages are, which allows

intelligence organizations worldwide to plot the organization charts of

terrorist organizations. Second, text mining is growing rapidly due to the

burgeoning growth of blogs, and the insights obtainable from these and

other forms of consumer-generated media. Companies are actively mining

blogs to discover what consumers are saying about them. Third, text mining

is becoming pervasive as many companies of all types have their most

important information stored in systems as comments, and the need to

interpret these comments and form a common thread of knowledge is needed.

Fifth, compliance in its many forms is making text mining a critical

ingredient of every manufacturer's mix of tools for getting a forward look

at what is happening in their installed base of customers. Warranties

specifically have been a focus on the part of American automobile

manufacturers given the TREAD Act, which is based on the strong of

Firestone Tire accidents on Ford Explorers (Mixer, 1). The Sarbanes-Oxley

Act also is forcing the issue of text mining as well. (Hagerty, 2) Taken

together, all these factors and more are making text mining a critical link

in both government and commercial strategies for defense, security,

insight, and growth.

Data mining's growth has equally been impressive, yet can be attributed to

different factors than text mining. First, there is the demand from within

many companies of creating a centralized repository of all content so the

data mart or data warehouse can quickly accessed for insights and

intelligence (Columbus and Murphy, 4). The future of data mining is in

predictive analytics, which is the ability to interpret then define the

future of key areas of any company, based on their previous performance.

The intent of this paper is to provide insights into these areas of data

mining, and the currently high levels of growth analytics use and

applications software are experiencing as a result.

Using Data Mining in Business Research

