Import Credit Data Set in R

We are using the German Credit Scoring Data Set in numeric format which contains information about 21 attributes of 1000 loans.

Downloads

First, setup a working directory and place this data file in that directory. Then, import the data into your R session using the following command:

Attribute Details

20 attributes are used in judging a loan applicant. The goal is the classify the applicant into one of two categories, good or bad, which is the first attribute, Creditability. This represents whether the credit applicant is creditable or not (Credit Worthiness).

Some more notes about the data:

  • Creditability
    • 0: Bad credit
    • 1: Good credit
  • Account Balance
    • 1: < 0 DM
    • 2: < 200 DM
    • 3: >= 200 DM
    • 4: No existing Account
  • Duration of Credit Month - Loan Duration in Months
  • Payment Status of Previous Credit - Credit History
    • 0: No credits taken so far
    • 1: All credit in this Bank paid back duly
    • 2: Existing credits paid back dully till now
    • 3: Delay in paying off in the past,
    • 4: Credits existing in other banks
  • Purpose - Loan Purpose
    • 0:new car purchase
    • 1: used car purchase
    • 2: furniture or equipment purchase
    • 3: radio or television purchase
    • 4: domestic appliances purchase
    • 5: repairs
    • 6: education
    • 7: vacation
    • 8: retraining
    • 9: Business
    • 10: others
  • Credit Amount (In DM - Deutsche Mark)
  • Value of Savings/Stocks
    • 1: < 100 DM
    • 2: >= 100 and < 500 DM
    • 3: >= 500 DM and 1000 DM
    • 4: >= 1000 DM
    • 5: no savings / bonds
  • Length of Current Employment
    • 1: unemployed
    • 2: < 1 year
    • 3: >= 1 and < 4 years
    • 4: >= 4 and < 7 years
    • 5: >= 7 years
  • Instalment Percent: Instalment rate in percentage of disposable income
  • Sex & Marital Status
    • 1: Divorced Male
    • 2: Divorced/Married Female
    • 3: Male Single
    • 4: Married Male
    • 5: Female Single
  • Guarantors
    • 1: None
    • 2: Co-applicant
    • 3: Guarantor
  • Duration in Current Address (In Years)
  • Most Valuable Available Asset
    • 1: Real Estate
    • 2: Life Insurance
    • 3: Car or others
    • 4: No property
  • Age in Years
  • Concurrent Credits
  • Type of Apartment
    • 1: Rented
    • 2: Owned
    • 3: For Free
  • No. of existing credits at this bank
  • Occupation (Job Status)
    • 1: Unemployed non-resident
    • 2: Unemployed resident
    • 3: Skilled Employee
    • 4: Self-Employed
  • No. of dependents
  • Telephone: German phone rates are very high, so fewer people own telephones
    • 1: Available
    • 2: Not Available
  • Foreign worker: There are millions of foreign worker working in Germany
    • 1: No
    • 2: Yes

This dataset is typical of data used in data mining: we have 1000 records.

Lesson Resources

All Users

Related Downloads

Membership
Learn the skills required to excel in data science and data analytics covering R, Python, machine learning, and AI.
I WANT TO JOIN
JOIN 30,000 DATA PROFESSIONALS

Free Guides - Getting Started with R and Python

Enter your name and email address below and we will email you the guides for R programming and Python.

Saylient AI Logo

Take the Next Step in Your Data Career

Join our membership for lifetime unlimited access to all our data analytics and data science learning content and resources.