- Machine Learning with Python
- What is Machine Learning?
- Data Preprocessing in Data Science and Machine Learning
- Feature Selection in Machine Learning
- Train-Test Datasets in Machine Learning
- Evaluate Model Performance - Loss Function
- Model Selection in Machine Learning
- Bias Variance Trade Off
- Supervised Learning Models
- Multiple Linear Regression
- Logistic Regression
- Logistic Regression in Python using scikit-learn Package
- Decision Trees in Machine Learning
- Random Forest Algorithm in Python
- Support Vector Machine Algorithm Explained
- Multivariate Linear Regression in Python with scikit-learn Library
- Classifier Model in Machine Learning Using Python
- Cross Validation to Avoid Overfitting in Machine Learning
- K-Fold Cross Validation Example Using Python scikit-learn
- Unsupervised Learning Models
- K-Means Algorithm Python Example
- Neural Networks Overview
Supervised Learning Models
As we pointed out earlier, both classification and regression models are in the field of Supervised Learning. These models are characterized by having a group of features or independent variables and a target variable that is the variable that the model aims to predict.
This target variable is called the labelled data and is the main property of the Supervised Learning models because it acts as the orientation for constructing the model in the training phase and to evaluate model performance.
In a classification problem the target variable to predict (y) is a categorical variable and can take a finite set of possible choices, K. On the other hand, in the regression problem, the target variable y is a real value rather than categorical.
The classification problem has the goal of estimating membership for a set of features into a particular group. A common classification problem in the financial sector is to determine the price direction for the next day based on N days of asset price history.
The regression problem involves estimate a real value response with a set of features or predictors as the independent variables. In the financial field, an example is to estimate tomorrows asset price based on the historical prices (or other features) of the price. The regression problem would estimate the real value of the price and not just its direction.
Common classification algorithms include Logistic Regression, Naïve Bayes Classifiers, Support Vector Machines, Decision Tree, and Deep Convolution Neural Networks. Common regression techniques include Linear Regression, Support Vector Regression and Random Forest.
In the next lessons, we will explain relevant concepts of the most popular algorithms used for Supervised Learning. These algorithms are the following:
- Multiple Linear Regression
- Logistic Regression
- Decision Tree-Random Forest
- Support Vector Machine
- Linear Discriminant Analysis
Data Science in Finance: 9-Book Bundle
Master R and Python for financial data science with our comprehensive bundle of 9 ebooks.
What's Included:
- Getting Started with R
- R Programming for Data Science
- Data Visualization with R
- Financial Time Series Analysis with R
- Quantitative Trading Strategies with R
- Derivatives with R
- Credit Risk Modelling With R
- Python for Data Science
- Machine Learning in Finance using Python
Each book includes PDFs, explanations, instructions, data files, and R code for all examples.
Get the Bundle for $29 (Regular $57)Free Guides - Getting Started with R and Python
Enter your name and email address below and we will email you the guides for R programming and Python.