freMTPL2freq: French Motor Third-Party Liability Claims and Risk Features
arff
Available on 1 platform
Sign in to view source links and access this dataset
Description
677,991 motor third-party liability insurance policies observed over one year, with associated risk features and claim numbers. The dataset contains 11 columns plus a policy ID, covering variables like vehicle power, driver age, bonus-malus score, and population density. It originates from the R package CASDatasets, version 1.0-6 (2016), and is used in actuarial data science tutorials.
Use Cases
Predicting claim frequency based on driver age, vehicle age, and bonus-malus score.
Analyzing regional risk differences based on the Area and Region codes.
Modeling the impact of vehicle characteristics like power, brand, and fuel type on claims.
Assessing exposure and risk for policy pricing using the provided Exposure period and claim counts.
Strengths
Contains 677,991 policy records, providing a substantial sample for analysis.
Includes 11 distinct risk features such as driver age, vehicle age, and population density.
Explicitly designed for actuarial modeling, as indicated by its use in professional society tutorials.
Limitations
Row count for the specific dataset file is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Last update date is unknown; freshness unverified.
Provenance
Source
R-Package CASDatasets, Version 1.0-6 (2016) by Christophe Dutang, Arthur Charpentier
Collection Method
Collected for motor third-party liability policies observed over a year.
Geography
France
License is GPL 2, which may impose specific redistribution requirements.