<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Online Marketing Analytics SEO SEM &#187; data mining</title>
	<atom:link href="http://www.praveenkodur.com/blog/tag/data-mining/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.praveenkodur.com/blog</link>
	<description>Contact praveen.kodur@gmail.com</description>
	<lastBuildDate>Thu, 22 Dec 2011 11:13:38 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Marketing Data Mining &#8211; Its easy to do badly</title>
		<link>http://www.praveenkodur.com/blog/2009/08/marketing-data-mining-its-easy-to-do-badly/</link>
		<comments>http://www.praveenkodur.com/blog/2009/08/marketing-data-mining-its-easy-to-do-badly/#comments</comments>
		<pubDate>Sat, 01 Aug 2009 11:12:51 +0000</pubDate>
		<dc:creator>kpraveenkumars</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Google Products]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[marketing data modeling]]></category>

		<guid isPermaLink="false">http://www.praveenkodur.com/blog/?p=126</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p>Companies are slowly adopting to the data mining as a concept, they are looking for quick and easy solution for their problems. On the other end there are whole host of companies which are offering software solution and tools for data mining.</p>
<p>There is a danger here, the easy use of GUI tools on large amount of data available is tempting and which makes users tempt to use black box methodologies available in the tools to solve their business problems. They mistakingly assume data mining is all about using a tool and running the data on the tool, this can be very hazardous and dangerous as actions are taken on business decisions.</p>
<p>Little knowledge is dangerous while applying powerful models.</p>
<p>Since the underlying logic used for model building in automated tools are unknown (their internal assumptions are unclear). Therefore it is very easy to do it in the wrong way assuming it is the right solution. Such mistakes are still made in companies.</p>
<p>People sometimes might argue that there is always a simpler solution to problems. I am the person who belives in Occam&#8217;s Razor, which means simplest explanation is always the best way. This means the interpretation of the model and the final solution should be as simple as possible. However,  method of arriving at solution must be as detail as possible taking every factor, element into consideration. Analysts must not make the mistake of over simplyfying the method and make too many assumptions.</p>
<p>A lot of knowledge workers believe that predictive modeling process can be automated by using tools and statistical software. These software are certainly useful but cannot replace the intellectual  of the analyst.</p>
<p>The processof building the model must be as detailed as possible, it has to tried &amp; tested on various data and measured by various parameters. All possible factors must be considered while building the model.</p>
<p>Therefore Analyst must understand the underlying algorithm , process of arriving score, model design etc.</p>
<p>More on such articles regarding online marketing, data mining, data modeling can be found at <a title="Online Marketing India" href="http://www.praveenkodur.com/blog" target="_blank">http://www.praveenkodur.com/blog</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.praveenkodur.com%2Fblog%2F2009%2F08%2Fmarketing-data-mining-its-easy-to-do-badly%2F&amp;linkname=Marketing%20Data%20Mining%20%26%238211%3B%20Its%20easy%20to%20do%20badly"><img src="http://www.praveenkodur.com/blog/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://www.praveenkodur.com/blog/2009/08/marketing-data-mining-its-easy-to-do-badly/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Mathematical Models &amp; Data Mining Models</title>
		<link>http://www.praveenkodur.com/blog/2009/07/mathematical-models-data-mining-models/</link>
		<comments>http://www.praveenkodur.com/blog/2009/07/mathematical-models-data-mining-models/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 17:33:13 +0000</pubDate>
		<dc:creator>kpraveenkumars</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Google Products]]></category>
		<category><![CDATA[Internet Marketing]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[mathematical models]]></category>
		<category><![CDATA[statisitical problems]]></category>

		<guid isPermaLink="false">http://www.praveenkodur.com/blog/?p=120</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p>Decision makers or knowledge workers are looking for information and knowledge at various kinds for making strategic or even mundane decisions. The process of extraction of information and knowledge from data is called data mining. This knowledge and information can be represented in various formats, this could be as simple as mean, median, mode, count or it could be in a graphical format such as histogram, line, pie chart, trend lines or moving averages. Advanced techniques like learning models, business optimizations are next level of knowledge requirment for the organization.</p>
<p>Even a using a tool as simple as a spreadsheet will be extremely helpful in providing a mental representation of business situation. Most commonly used statistical techniques can be implemented in a spreadsheet as simple as MS EXCEL.</p>
<p>There are some very important techniques to business intelligence analysis. Most importantly defining the objective and performance indicators, these are metrics that are used to estimate performance of an object (entity). The next is developing mathematical relationships between variables and metrics through finding patterns. The last is What-If analysis is by determining variations in the output metric by changing the input variables.</p>
<p>The advantage of using mathematical models is beyond increasing performance and ROI. It helps knowledge workers in deeper analysis of the business and underlying product/domain. This will increase awareness in the company, knowledge transfer within the company,  and higher desire to learn better things. It encourages intellectual thinking within the company and promote people with good analytical skills who can offer great value to the company.\</p>
<p>There are many techniques like regression and classification, which are some of the popular mathematical models,however predictive analytics are not limited to these methods.</p>
<p>Regression:      Linear Regression, kNN, CART, Neural Net</p>
<p>Classification: Logistic Regression, Bayesian Methods, Discriminant Analysis, Neural Net, kNN, CART.</p>
<p>There are some limitations and advantages of each of the methods. The right model and right mathematical technique to be choosen for each problem. The underlining business value that needs to be increased with each of the techniques.</p>
<p>Best techniques are formulates after and testing and evaluating each approach and measuring the impact of success or performance.  All mathematical models use simple statistical techniques, however the value is in mapping the business problem into a mathematical problem, this requires some intellectual talent.</p>
<p>Models developed can be extremely useful in business critical process like sales, marketing and product.</p>
<p>More on such topics will be published on <a title="Online Marketing India" href="http://www.praveenkodur.com/blog/ " target="_self">http://www.praveenkodur.com/blog/</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.praveenkodur.com%2Fblog%2F2009%2F07%2Fmathematical-models-data-mining-models%2F&amp;linkname=Mathematical%20Models%20%26%23038%3B%20Data%20Mining%20Models"><img src="http://www.praveenkodur.com/blog/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://www.praveenkodur.com/blog/2009/07/mathematical-models-data-mining-models/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Analytics Solution for Business</title>
		<link>http://www.praveenkodur.com/blog/2009/07/analytics-solution-for-business/</link>
		<comments>http://www.praveenkodur.com/blog/2009/07/analytics-solution-for-business/#comments</comments>
		<pubDate>Tue, 28 Jul 2009 18:19:14 +0000</pubDate>
		<dc:creator>kpraveenkumars</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Google Products]]></category>
		<category><![CDATA[Internet Marketing]]></category>
		<category><![CDATA[Web Analytics]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[predictive analytics]]></category>

		<guid isPermaLink="false">http://www.praveenkodur.com/blog/?p=112</guid>
		<description><![CDATA[techniques ]]></description>
			<content:encoded><![CDATA[<p>Analytics plays a important role in customer relationship management. CRM is area which includes a whole host of activities which involves using the CRM software to campaign management tool to call tracking tool to database marketing tool etc. Each tool holds a customer information and in some way has a touch point with the customer. For building a good business it is important to deal with customer individually, rather than deal with competitor.</p>
<p>Focus for many companies such as Banks, Insurance companies, Telecommunication companies is to track customer end to end, from the time money spent on the customer acquisition to the time revenue is generated from the consumer for product/service until the customer attrites from the firm. Every activity in the chain must be closely observed to evaluate the true value of customer.  Companies worldwide are creating process to deal with customers  individually.  This can help them devout more attention to customers who are more valuable to the business and let go customers not valuable to business.</p>
<p>Data Mining requires a lot of effort/techniques and focus to centre their business around the customer than a product. Companies have to constantly keep a watch on what their customers are doing, keep in mind their past actions, discover knowledge from their actions (gain knowledge)  and finally use the knowledge cleverly to make decisions to make profit.</p>
<p>However data mining is not always beneficial for the user, consider the fact a model recommends a service to a user instead of the product A. however if the business makes more profit on product and very negligible amount on service. Model recommendations may not be implemented. However it helps in understanding such customers.</p>
<p>Good question to ask is how can a consumer company with large base of customers can individually deal with each consumer. This can be accomplished intelligently by deploying effective technology solutions which is customisable based on data mining models and techniques. Nowadays the customer is the data entry operator who enters data into the system at various points and these are captured by the system.  Consider your bank, your touch points are ATM, Bank Branch, customer care who responds on phone, mail, hard mail &amp; lastly your account /loan/credit card that you hold of the bank. Each transaction records your behaviour which can be an additional knowledge that bank is made aware of, this knowledge can be used by the bank to learn more about you and customise the next interaction or next touch point instance with you.</p>
<p>The transaction system records every instance of transaction of the user enabling the bank to analyse the nature of every transaction and update your profile with rich insights. The knowledge discovery doesnt end here. It needs support of data warehousing system along with extremely good data mining models to be able to take action / make decisions and deal with each customer. More on this will be written in more detail in coming articles.</p>
<p>The blog  <a title="Online Marketing India" href="http://www.praveenkodur.com/blog/">http://www.praveenkodur.com/blog/</a> will be updated with more such analytics, online marketing articles.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.praveenkodur.com%2Fblog%2F2009%2F07%2Fanalytics-solution-for-business%2F&amp;linkname=Analytics%20Solution%20for%20Business"><img src="http://www.praveenkodur.com/blog/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://www.praveenkodur.com/blog/2009/07/analytics-solution-for-business/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Marketing Analytics &#8211; Predictive Modeling(I)</title>
		<link>http://www.praveenkodur.com/blog/2009/07/marketing-analytics-i-predictive-modeling/</link>
		<comments>http://www.praveenkodur.com/blog/2009/07/marketing-analytics-i-predictive-modeling/#comments</comments>
		<pubDate>Sat, 11 Jul 2009 11:51:51 +0000</pubDate>
		<dc:creator>kpraveenkumars</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Google Products]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[Online Business]]></category>
		<category><![CDATA[Product Marketing]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[ppc analytics]]></category>
		<category><![CDATA[predictive analytics]]></category>

		<guid isPermaLink="false">http://www.praveenkodur.com/blog/?p=82</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p>This is the most widely used area of data mining techniques, in sales and marketing to understand customer and their behavior on various products offered by the company. It helps companies in understanding and predicting customer behavior for each specific situation and therefore introduce sophistication in targeting. There are lot of information generally collected about the customer such as demographic, geographic, lifestyle, attitudinal, behavior and many more. These data can be efficiently used to model customer behavior under different circumstances (situations). Through effective models we can improve ROI of marketing efforts &amp; campaigns to make an impact in the overall profitability of the business.</p>
<p>Some of the common uses of analytical models are:</p>
<p><strong><span style="text-decoration: underline;">Acquiring new customers:</span></strong> Acquiring new customers is generally a big cost for the company, with increase in prices on various acquisition channels, companies find it hard to reduce cost in acquiring good profitable customers. Predictive modeling to certain extent becomes very useful in reducing costs in acquiring right customers and increase profitability. It also helps in designing marketing offers, special campaigns for customers to reach out to them.</p>
<p>Predictive models use historical data of customer attributes to understand relationships between attributes and their specific response or behavior. The output of predictive model is generally to predict a future response of the customer with their present data. These models are generally used to rank a list of prospective customers on the likelihood of their predicted response. This is very useful as it helps take present decisions. Other factors can also be fit into the response such as risk of acquiring customers like credit risk, or the cost of retaining customer. We can also predict response to each specific product, which helps in better targeting the customer and increases chances of acquiring them or even for cross selling products</p>
<p>(<span style="text-decoration: underline;">http://www.stochasticsolutions.com/pdf/CrossSell.pdf</span>).</p>
<p>Additionally, there might be situations where it might not be easy to connect both customer and the response desired. Additionally there may be situations where various responses of customers are also valuable. For example, a customer might purchase a product after visiting the site 4 times, each time the customer performs a action like search, sending an enquiry, contacting user, reading knowledge material etc. Each action may be valuable to business therefore need to included in the model. In all such cases we need to use proxies to understand purchase behavior. Model building is a tedious task, but very worthy effort in increasing profitability.</p>
<p>In process there are additional outputs generated like customer profiling, customer segmentation, clustering and affinity pairs which is very valuable in developing products for customers.</p>
<p><a href="http://en.wikipedia.org/wiki/Predictive_analytics">http://en.wikipedia.org/wiki/Predictive_analytics</a></p>
<p><strong> </strong></p>
<p><strong><span style="text-decoration: underline;">Customers Retention</span>:</strong> The second problem is of retaining existing customers, companies are willing to provide offers to retain customers. The data mining problem will translate into finding the customers at risk (or customers who are looking to switch) and additionally identify those customers who are more likely to change behaviour due the marketing offer given by the company. (<span style="text-decoration: underline;">http://www.stochasticsolutions.com/pdf/FinanceRetention.pdf</span>).</p>
<p>The data useful will be behavior of customers before attrition for certain time duration along with their demographics, attitudinal &amp; behavioral patterns.</p>
<p>Predicting customer attrition rate is separate from predicting their behavioral change because of promotional offer. These models are extremely useful where markets are saturated and acquiring new customers becomes increasingly difficult.</p>
<p>Marketing techniques for retaining customer could backfire sometimes, resulting in loss of customer due to persuation from the company. There are also certain customers who would attrite irrespective of any marketing offered to them. Capturing this behavioral difference can be done through Control Groups, Test Groups &amp; Hold Groups. This method is called as Differential Response modeling or Incremental impact model or Uplift model or Net model. Here is more info on the same: <a href="http://en.wikipedia.org/wiki/Uplift_modelling">http://en.wikipedia.org/wiki/Uplift_modelling</a>, and few white papers on the same <span style="text-decoration: underline;">http://www.stochasticsolutions.com/pdf/SavedAndDrivenAway.pdf</span>. Customers in Control Group are randomly targeted, Customers in Test Groups are targeted based on model, Hold Group Customers are not considered for targeting of offer. Results of test and experiment is used in building the Net Model.</p>
<p>The industries where these techniques will be highly useful are Financial Services, Retail, Telecom, Internet Companies and Software Houses.</p>
<p>Do check back for more articles on this topic.</p>
<p><span style="text-decoration: underline;"><strong>List of resources to find more information about Analytics</strong></span></p>
<p>1) http://www.destinationcrm.com/Articles/CRM-News/Daily-News/Predictive-Analytics-Can-Pinpoint-Profitable-Customers-52164.aspx</p>
<p>2) http://scientificmarketer.com/search/label/response</p>
<p>3) http://www.redclaymedia.com/response_modeling.php</p>
<p>4) http://stochasticsolutions.com/retention.html</p>
<p>5) http://www.information-management.com/specialreports/2008_62/10000747-1.html?ET=dmreview:e323:1015879a:&amp;st=email</p>
<p>6) http://www.information-management.com/issues/2007_52/10001990-1.html</p>
<p>7) http://www.predictiveanalyticsinsight.com/articles/callcenter.htm</p>
<p> <img src='http://www.praveenkodur.com/blog/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> http://www.predictiveanalyticsworld.com/predictive_analytics.php</p>
<p>9) http://www.marketingprofs.com/4/shearer1.asp</p>
<p>10) http://semphonic.blogs.com/semangel/2009/01/predictive-analytics-getting-a-legup-on-where-analytics-is-headed-.html</p>
<p>The Original article is present on <a title="Online Marketing India" href="http://www.praveenkodur.com/blog" target="_blank">http://www.praveenkodur.com/blog</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.praveenkodur.com%2Fblog%2F2009%2F07%2Fmarketing-analytics-i-predictive-modeling%2F&amp;linkname=Marketing%20Analytics%20%26%238211%3B%20Predictive%20Modeling%28I%29"><img src="http://www.praveenkodur.com/blog/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://www.praveenkodur.com/blog/2009/07/marketing-analytics-i-predictive-modeling/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Missing Value Imputation &#8211; Data Analytics</title>
		<link>http://www.praveenkodur.com/blog/2009/07/data-analytics-missing-value-imputation/</link>
		<comments>http://www.praveenkodur.com/blog/2009/07/data-analytics-missing-value-imputation/#comments</comments>
		<pubDate>Tue, 07 Jul 2009 17:06:04 +0000</pubDate>
		<dc:creator>kpraveenkumars</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Internet Marketing]]></category>
		<category><![CDATA[Yahoo Products]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[predictive analytics]]></category>

		<guid isPermaLink="false">http://www.praveenkodur.com/blog/?p=74</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p>Data analytics involves a lot of transformations and therefore requires a careful attention to detail. The data generally contains many inconsistencies; the most common discrepancy is issue of Missing Values. Even a modest amount of missing values scattered throughout the data set will cause significant reduction in sample set. There are various methods by which you can handle missing values in the data. This process is known as imputation.</p>
<p>1) When the dependent variable contains missing values, simply eliminate the records.</p>
<p>2) Correctly Identify slices of data and Substitute with measure of central tendency like Median, Mean &amp; Mode. Identifying the right slice is also important. You can group by various parameters and take a central tendency. Choose the one with highest bias (chi-square)</p>
<p>3) If the missing value forms a Normal distribution pattern, find the missing value by normal inverse function.</p>
<p>4) Treating the missing values as a dependent variable in a regression equation. Use the multiple linear regression function to impute the missing variable. You can try other methods instead of regression like classification, decision tree etc.</p>
<p>5) Use business logic to understand the missing values.</p>
<p>6) Check the data capturing process, there could a error present at source of data entry. Also it helps identify if the missing data points are at random or non-random. If it is random missing error then you can use simple imputations, however if it is non-random then you need advanced techniques to impute values. Also look at bias in the particular column, if the bias is significant then you need advanced techniques. If bias is minimal then you can proceed with simple imputation.</p>
<p>7) Identify the list of possible values for the missing data set. Try and replace each possible value and create different data sets and build the model. Calculate differences in accuracies and consistency based on different substitutes. This way you can even add variation of the values into the missing element and remove bias.</p>
<p> <img src='http://www.praveenkodur.com/blog/wp-includes/images/smilies/icon_cool.gif' alt='8)' class='wp-smiley' /> Use regression to determine the distribution of the values in place of missing values. Create a What-If scenario by imputing every range of value.</p>
<p>9) Do nothing remove missing values and duplicate records of sample data set to increase the size of the data set.</p>
<p>10) Measure similarlity of records like vectors. The similarity is the cosine function between records, and find similar records to the missing data values.</p>
<p>11) Use logistic regression to measure likelihood of observed or likelihood of missing. If value missing the output is 0, else 1. The rest of the variables (non-missing) act as independent variables. This does not predict anything but only a likelihood of finding the variable missing. Records with same probability or closest probability is considered similar and missing data is donated.</p>
<p>Multiple imputation generally yields better results but it requires high-end statistical software for computation. It becomes necessary to use the help of statistical software.</p>
<p>This article is originally found on Praveen Kodur <a title="Online Marketing India" href="http://www.praveenkodur.com/blog/" target="_blank">http://www.praveenkodur.com/blog/</a>.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.praveenkodur.com%2Fblog%2F2009%2F07%2Fdata-analytics-missing-value-imputation%2F&amp;linkname=Missing%20Value%20Imputation%20%26%238211%3B%20Data%20Analytics"><img src="http://www.praveenkodur.com/blog/wp-content/plugins/add-to-any/share_save_120_16.png" width="120" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://www.praveenkodur.com/blog/2009/07/data-analytics-missing-value-imputation/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

