Currently, we can only view the feature weights (or coefficient estimates) of a trained linear regression through 'visualize' option but not possible to save this as a table or dataset.
I am experimenting on a market-mix model to understand the incremental sales lift by each media variable, so I need to save the regression estimates.
Is there any workaround for this other than to use 'Execute R' module.
Related
I am currently working on univariate GARCH models with different specifications and got stuck on including the exponential term in the variance equation:
mean model (setting ω4 = 0)
variance model
I am using the rugarch package in R and (unsuccessfully) tried the 'eGARCH' model type and external regressor option for the recession dummy INBER to get the estimates. Is this generally the correct way for including the exponential part or am I completely off?
I usually use R to make my own statistical models based on data that I have.
However, I have recently read about a logistic regression model in a scientific publication and I want to replicate this model to make predictions on some of my own data, which includes the same variables.
Is there a way to "declare" a model in R, based on the coefficients published in the paper?
In Azure ML, I have a predictive regression model using boosted decision tree regression and it is reasonably accurate.
The input dataset has over 450 columns and the model has done a good job of predicting against test data sets, without over-fitting.
To report on the result i need to know what features/columns the model mainly used to make predictions but i cant find this information easily when looking at the trained model data.
How do i identify this information? Im happy to import the result dataset into R to help find this but I just need pointers on what direction to start working in.
Mostly, in using Microsoft Azure Machine Learning, when looking at the features that is mainly used to make predictions, it is found on the output of the Train Model module.
But on using Decision Trees as your algorithm, the output of your Train Model module would be the constructed 'trees' of the algorithm, and it looks like this:
To know the features that made impact on predictions while using Decision Trees algorithms, you can use the Permutation Feature Importance module. Look at the sample experiment below:
The parameters of Permutation Feature Importance are Random Seed and Metric for Measuring Performance (in this case, Regression - Coefficient of Determination)
The left input of Permutation Feature Importance is your trained model, and the right input is your test data.
The output of Permutation Feature Importance looks like this:
You can add Execute R Script to extract the Features and Scores from Permutation Feature Importance module.
I have data that seems to fit a polynomial regression much better than a linear regression, but Azure Machine Learning Studio doesn't have native support for polynomial regressions. Is there a way to transform data and produce a web service capable of predicting values that fit this type of data?
I found a nice article here that talks about how to train a model to fit a polynomial regression even though Azure ML doesn't currently natively supports only linear regressions. The TL;DR of the article calls for adding one or more columns to the data with the square, cube, etc. of the label column in order to improve the accuracy of the model. That approach has improved my model's coefficient of determination to 0.973346. However, when I create the "predictive experiment" and the web service to predict new values, and ask the service to predict a new value, the accuracy is horrendous, like not even in the ballpark.
How can I train a predictive model to data that fits a polynomial regression using Azure Machine Learning Studio?
How does Azure ML handle categorical columns during training a linear regression model? A linear regression model takes continuous values. However, even though I haven't changed anything of those categorical columns, Azure ML trains linear and logistic regression without error. So I would like to know how Azure ML manages to process categorical columns behind the scene. Thanks!
It depends upon the model you are using, but you can get clues to how it's done by right-clicking on the "Train Model" element in your experiment, then clicking "Trained Model" -> "Visualize". The visualization will show you how it's used the supplied data.
The linear regression module will only take numeric independent variables. Are you sure you had this working with categoricals in the linear regression?
https://msdn.microsoft.com/en-us/library/azure/dn905978.aspx