10 datasets found
  1. Lending Club

    • figshare.com
    txt
    Updated Mar 6, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepchecks Data (2023). Lending Club [Dataset]. http://doi.org/10.6084/m9.figshare.22121477.v4
    Explore at:
    txtAvailable download formats
    Dataset updated
    Mar 6, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Deepchecks Data
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset is a modified version of the Kaggle Lending Club dataset found at https://www.kaggle.com/datasets/wordsforthewise/lending-club, including a model trained on the training set.

    The data contains 2007 through 2018 Lending Club accepted and rejected loan data.

    The dataset is licenced under CC0 1.0 Universal (CC0 1.0) Public Domain Dedication https://creativecommons.org/publicdomain/zero/1.0/

  2. D

    Lending Club Loan Data

    • find.data.gov.scot
    • dtechtive.com
    csv
    Updated Mar 14, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deloitte Datathon 2018 (uSmart) (2018). Lending Club Loan Data [Dataset]. https://find.data.gov.scot/datasets/39183
    Explore at:
    csv(421.3062 MB), csv(472.0285 MB)Available download formats
    Dataset updated
    Mar 14, 2018
    Dataset provided by
    Deloitte Datathon 2018 (uSmart)
    Description

    Data related to the peer-to-peer lending company Lending Club. For more info, see: https://www.kaggle.com/wendykan/lending-club-loan-data/data

  3. Lending Club Loan Dataset

    • kaggle.com
    zip
    Updated Jul 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    syamalakumar (2021). Lending Club Loan Dataset [Dataset]. https://www.kaggle.com/syamalakumar/lending-club-loan-dataset
    Explore at:
    zip(101852257 bytes)Available download formats
    Dataset updated
    Jul 21, 2021
    Authors
    syamalakumar
    Description

    Dataset

    This dataset was created by syamalakumar

    Contents

    It contains the following files:

  4. Lending Club Dataset (Practise)

    • kaggle.com
    Updated Dec 6, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AlvinChow (2022). Lending Club Dataset (Practise) [Dataset]. https://www.kaggle.com/datasets/alvinchowcm/lending-club-dataset-practise
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    AlvinChow
    Description

    Dataset

    This dataset was created by AlvinChow

    Contents

  5. Lending club train

    • figshare.com
    txt
    Updated Jun 7, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepchecks Data (2022). Lending club train [Dataset]. http://doi.org/10.6084/m9.figshare.20015915.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 7, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Deepchecks Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
  6. u

    Data from: Lending Club loan dataset for granting models

    • portalcientifico.uah.es
    • produccioncientifica.ucm.es
    Updated 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ariza-Garzón, Miller Janny; Sanz-Guerrero, Mario; Arroyo Gallardo, Javier; Lending Club; Ariza-Garzón, Miller Janny; Sanz-Guerrero, Mario; Arroyo Gallardo, Javier; Lending Club (2024). Lending Club loan dataset for granting models [Dataset]. https://portalcientifico.uah.es/documentos/668fc499b9e7c03b01be2366
    Explore at:
    Dataset updated
    2024
    Authors
    Ariza-Garzón, Miller Janny; Sanz-Guerrero, Mario; Arroyo Gallardo, Javier; Lending Club; Ariza-Garzón, Miller Janny; Sanz-Guerrero, Mario; Arroyo Gallardo, Javier; Lending Club
    Description

    Lending Club offers peer-to-peer (P2P) loans through a technological platform for various personal finance purposes and is today one of the companies that dominate the US P2P lending market. The original dataset is publicly available on Kaggle and corresponds to all the loans issued by Lending Club between 2007 and 2018. The present version of the dataset is for constructing a granting model, that is, a model designed to make decisions on whether to grant a loan based on information available at the time of the loan application. Consequently, our dataset only has a selection of variables from the original one, which are the variables known at the moment the loan request is made. Furthermore, the target variable of a granting model represents the final status of the loan, that are "default" or "fully paid". Thus, we filtered out from the original dataset all the loans in transitory states. Our dataset comprises 1,347,681 records or obligations (approximately 60% of the original) and it was also cleaned for completeness and consistency (less than 1% of our dataset was filtered out).

    TARGET VARIABLE

    The dataset includes a target variable based on the final resolution of the credit: the default category corresponds to the event charged off and the non-default category to the event fully paid. It does not consider other values in the loan status variable since this variable represents the state of the loan at the end of the considered time window. Thus, there are no loans in transitory states. The original dataset includes the target variable “loan status”, which contains several categories ('Fully Paid', 'Current', 'Charged Off', 'In Grace Period', 'Late (31-120 days)', 'Late (16-30 days)', 'Default'). However, in our dataset, we just consider loans that are either “Fully Paid” or “Default” and transform this variable into a binary variable called “Default”, with a 0 for fully paid loans and a 1 for defaulted loans.

    EXPLANATORY VARIABLES

    The explanatory variables that we use correspond only to the information available at the time of the application. Variables such as the interest rate, grade, or subgrade are generated by the company as a result of a credit risk assessment process, so they were filtered out from the dataset as they must not be considered in risk models to predict the default in granting of credit.

    FULL LIST OF VARIABLES

    Loan identification variables:

    id: Loan id (unique identifier).

    issue_d: Month and year in which the loan was approved.

    Quantitative variables:

    revenue: Borrower's self-declared annual income during registration.

    dti_n: Indebtedness ratio for obligations excluding mortgage. Monthly information. This ratio has been calculated considering the indebtedness of the whole group of applicants. It is estimated as the ratio calculated using the co-borrowers’ total payments on the total debt obligations divided by the co-borrowers’ combined monthly income.

    loan_amnt: Amount of credit requested by the borrower.

    fico_n: Defined between 300 and 850, reported by Fair Isaac Corporation as a risk measure based on historical credit information reported at the time of application. This value has been calculated as the average of the variables “fico_range_low” and “fico_range_high” in the original dataset.

    experience_c: Binary variable that indicates whether the borrower is new to the entity. This variable is constructed from the credit date of the previous obligation in LC and the credit date of the current obligation; if the difference between dates is positive, it is not considered as a new experience with LC.

    Categorical variables:

    emp_length: Categorical variable with the employment length of the borrower (includes the no information category)

    purpose: Credit purpose category for the loan request.

    home_ownership_n: Homeownership status provided by the borrower in the registration process. Categories defined by LC: “mortgage”, “rent”, “own”, “other”, “any”, “none”. We merged the categories “other”, “any” and “none” as “other”.

    addr_state: Borrower's residence state from the USA.

    zip_code: Zip code of the borrower's residence.

    Textual variables

    title: Title of the credit request description provided by the borrower.

    desc: Description of the credit request provided by the borrower.

    We cleaned the textual variables. First, we removed all those descriptions that contained the default description provided by Lending Club on its web form (“Tell your story. What is your loan for?”). Moreover, we removed the prefix “Borrower added on DD/MM/YYYY >” from the descriptions to avoid any temporal background on them. Finally, as these descriptions came from a web form, we substituted all the HTML elements by their character (e.g. “&” was substituted by “&”, “<” was substituted by “<”, etc.).

    RELATED WORKS

    This dataset has been used in the following academic articles:

    Sanz-Guerrero, M. Arroyo, J. (2024). Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending. arXiv preprint arXiv:2401.16458. https://doi.org/10.48550/arXiv.2401.16458

    Ariza-Garzón, M.J., Arroyo, J., Caparrini, A., Segovia-Vargas, M.J. (2020). Explainability of a machine learning granting scoring model in peer-to-peer lending. IEEE Access 8, 64873 - 64890. https://doi.org/10.1109/ACCESS.2020.2984412

  7. Lending Club Data

    • kaggle.com
    zip
    Updated Jul 29, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Srujana (2021). Lending Club Data [Dataset]. https://www.kaggle.com/srujanakandula/lending-club-data
    Explore at:
    zip(8786978 bytes)Available download formats
    Dataset updated
    Jul 29, 2021
    Authors
    Srujana
    Description

    Dataset

    This dataset was created by Srujana

    Contents

  8. Lending club test

    • figshare.com
    txt
    Updated Jun 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepchecks Data (2022). Lending club test [Dataset]. http://doi.org/10.6084/m9.figshare.20015996.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 7, 2022
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Deepchecks Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description
  9. Lending_Club

    • kaggle.com
    zip
    Updated Sep 5, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rachit (2019). Lending_Club [Dataset]. https://www.kaggle.com/datasets/rachitsuccess/lending-club
    Explore at:
    zip(8654354 bytes)Available download formats
    Dataset updated
    Sep 5, 2019
    Authors
    Rachit
    Description

    Context

    LendingClub is a US peer-to-peer lending company, headquartered in San Francisco, California. It was the first peer-to-peer lender to register its offerings as securities with the Securities and Exchange Commission, and to offer loan trading on a secondary market.

    Content

    42K rows and 144 features

  10. Loan Data for Dummy Bank

    • kaggle.com
    Updated Aug 4, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MuhammadNadeemFerozi (2018). Loan Data for Dummy Bank [Dataset]. https://www.kaggle.com/datasets/mrferozi/loan-data-for-dummy-bank/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 4, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    MuhammadNadeemFerozi
    Description

    Company Information:

    The data set is based upon https://www.kaggle.com/prateikmahendra/loan-data"> Lending Club Information . - TheIrish Dummy Banks is a peer to peer lending bank based in the ireland, in which bank provide funds for potential borrowers and bank earn a profit depending on the risk they take (the borrowers credit score). Irish Fake bank provides loan to their loyal customers. The complete data set is borrowed from Lending Club For more basic information about the company please check out the wikipedia article about the company. This dataset is copied and clean from kaggle but it has been changed. The any kind of similarity is just for learning purposes. I dont have any intention for Plagiarism I just like to be clear myself.

    <a src="https://en.wikipedia.org/wiki/Lending_Club"> Lending Club Information </a>
    

    The central idea and coding is abstract from Kevin mark ham youtube video series, Introduction to machine learning with scikit-learn video series. You can find link under resources section.

    Data Description

    • LoanStatNew Description

    • addr_state The state provided by the borrower in the loan application

    • annual_inc The self-reported annual income provided by the borrower during registration.

    • annual_inc_joint The combined self-reported annual income provided by the co-borrowers during registration

    • application_type Indicates whether the loan is an individual application or a joint application with two co-borrowers

    • collection_recovery_fee post charge off collection fee

    • collections_12_mths_ex_med Number of collections in 12 months excluding medical collections

    • delinq_2yrs The number of 30+ days past-due incidences of delinquency in the borrower's credit file for the past 2 years

    • desc Loan description provided by the borrower

    • dti A ratio calculated using the borrower’s total monthly debt payments on the total debt obligations, - - - excluding mortgage and the requested LC loan, divided by the borrower’s self-reported monthly income.

    • dti_joint A ratio calculated using the co-borrowers' total monthly payments on the total debt obligations, - excluding mortgages and the requested LC loan, divided by the co-borrowers' combined self-reported monthly income

    • earliest_cr_line The month the borrower's earliest reported credit line was opened

    • emp_length Employment length in years. Possible values are between 0 and 10 where 0 means less than one year

    • and 10 means ten or more years.

    • emp_title The job title supplied by the Borrower when applying for the loan.*

    • fico_range_high The upper boundary range the borrower’s FICO at loan origination belongs to.

    • fico_range_low The lower boundary range the borrower’s FICO at loan origination belongs to.

    • funded_amnt The total amount committed to that loan at that point in time.

    • funded_amnt_inv The total amount committed by investors for that loan at that point in time.

    • grade LC assigned loan grade

    • home_ownership The home ownership status provided by the borrower during registration. Our values are: RENT, OWN, MORTGAGE, OTHER.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Deepchecks Data (2023). Lending Club [Dataset]. http://doi.org/10.6084/m9.figshare.22121477.v4
Organization logo

Lending Club

Explore at:
txtAvailable download formats
Dataset updated
Mar 6, 2023
Dataset provided by
Figsharehttp://figshare.com/
Authors
Deepchecks Data
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Description

This dataset is a modified version of the Kaggle Lending Club dataset found at https://www.kaggle.com/datasets/wordsforthewise/lending-club, including a model trained on the training set.

The data contains 2007 through 2018 Lending Club accepted and rejected loan data.

The dataset is licenced under CC0 1.0 Universal (CC0 1.0) Public Domain Dedication https://creativecommons.org/publicdomain/zero/1.0/

Search
Clear search
Close search
Google apps
Main menu