Decision Tree to predict the value of a continuous target variable
Predict the value of a continuous variable such as price, turn around time, or mileage using
wink-regression-tree. It is a part of wink — a growing family of high quality packages for Statistical Analysis, Natural Language Processing and Machine Learning in NodeJS.
Use npm to install:
npm install wink-regression-tree --save
Here is an example of predicting car’s mileage (miles per gallon - mpg) from attributes like displacement, horsepower, acceleration, country of origin, and few more. A sample data row is given for quick reference:
|Toyota Mark II||20||6||large displacement||high power||high weight||slow||73||Japan|
The code below provides a potential configuration to predict the value of miles per gallon:
// Load wink-regression-tree.var regressionTree = ;// Load cars training data set.// In practice an async mechanism may be used to// read data asynchronously and call `ingest()` on// every row of data read.var cars = ;// Create a sample data to test prediction for// Ford Gran Torino, having "mpg of 14.5", very// large displacement, extremely high power, very// high weight, slow, and with origin as US.var input =model: 'Ford Gran Torino'weight: 'very high weight'displacement: 'very large displacement'horsepower: 'extremely high power'origin: 'US'acceleration: 'slow';// Above record is not the part of training data.// Create an instance of the regression tree.var rt = ;// Specify columns of the training data.var columns =name: 'model' categorical: true exclude: truename: 'mpg' categorical: false target: truename: 'cylinders' categorical: true exclude: falsename: 'displacement' categorical: true exclude: falsename: 'horsepower' categorical: true exclude: falsename: 'weight' categorical: true exclude: falsename: 'acceleration' categorical: true exclude: falsename: 'year' categorical: true exclude: truename: 'origin' categorical: true exclude: false;// Specify configuration for learning.var treeParams =minPercentVarianceReduction: 05minLeafNodeItems: 10minSplitCandidateItems: 30minAvgChildrenItems: 2;// Define the regression tree configuration using// `columns` and `treeParams`.rt;// Ingest the data.cars;// Data ingested! Now time to learn from data!!console;// -> 16 (Number of Rules Learned)// Predict the **mean** value.var mean = rt;console;// -> 14.3 ( compare with actual mpg of 14.5 )// In practice one may like to compute a range// or upper limit using the `modifier` function// during prediction. Note `size`, `mean`, and `stdev`// values, passed to this function, can be used// for computing the range or the upper limit.
Try experimenting with this example on Runkit in the browser.
For detailed API docs, check out http://winkjs.org/wink-regression-tree/ URL!
If you spot a bug and the same has not yet been reported, raise a new issue or consider fixing it and sending a pull request.
Copyright & License
wink-regression-tree is copyright 2017 GRAYPE Systems Private Limited.
It is licensed under the under the terms of the GNU Affero General Public License as published by the Free Software Foundation, version 3 of the License.