Credit Risk Modelling - Case Study- Lending Club Data

To build a good model, it is important to use high quality data. For the purpose of this course, we will use the loan data available From LendingClub's website. LendingClub is a US peer-to-peer lending company which matches borrowers with investors willing to fund their loans.

LendingClub provides rich datasets a which are available to public for analysis and can be downloaded from the following link:

https://www.lendingclub.com/info/download-data.action

We have downloaded the data for 2016 Quarter 1, which contains complete loan data for all loans issued during Q1 2016, including the current loan status (Current, Late, Fully Paid, etc.) and latest payment information. When you download the data file (csv), the original file is named LoanStats_2016Q1.csv. We have renamed the file to loan_data_2017.csv for simplicity. (Note: This is a big file - 109MB)

Before we begin building our model, our first step would be to explore the data, understand more about it, and also transform the data where necessary so that it is suitable for analysis. We have intentionally chosen to demonstrate things here starting with the raw data file in order to emulate a real model building scenario.

Download the data file either directly from the LendingClub website or from the link below:

Downloads

Once the file is downloaded, move it to your R working directory. This tutorial is creating on a MacBook, so the paths are set according the MacBook’s folder structure.

> setwd("~/Dropbox/Finance Train/Courses/Credit_Risk_Modelling_R")

The above command sets the given folder as the working directory where I have stored the data file.

Lesson Resources

All Users

Related Downloads

Data Science in Finance: 9-Book Bundle

Data Science in Finance Book Bundle

Master R and Python for financial data science with our comprehensive bundle of 9 ebooks.

What's Included:

  • Getting Started with R
  • R Programming for Data Science
  • Data Visualization with R
  • Financial Time Series Analysis with R
  • Quantitative Trading Strategies with R
  • Derivatives with R
  • Credit Risk Modelling With R
  • Python for Data Science
  • Machine Learning in Finance using Python

Each book includes PDFs, explanations, instructions, data files, and R code for all examples.

Get the Bundle for $39 (Regular $57)
JOIN 30,000 DATA PROFESSIONALS

Free Guides - Getting Started with R and Python

Enter your name and email address below and we will email you the guides for R programming and Python.

Data Science in Finance: 9-Book Bundle

Data Science in Finance Book Bundle

Master R and Python for financial data science with our comprehensive bundle of 9 ebooks.

What's Included:

  • Getting Started with R
  • R Programming for Data Science
  • Data Visualization with R
  • Financial Time Series Analysis with R
  • Quantitative Trading Strategies with R
  • Derivatives with R
  • Credit Risk Modelling With R
  • Python for Data Science
  • Machine Learning in Finance using Python

Each book comes with PDFs, detailed explanations, step-by-step instructions, data files, and complete downloadable R code for all examples.