Table 1 Individual variables.

From: A dataset for health insurance analysis: Integrating individual and area-based contextual variables

Variable

Description

Data type

ID

Identification code.

Factor

ID_policy

(Generic) Policy identification code.

Integer

ID_insured

Insured identification code.

Integer

period

Calendar year (YYYY).

Date

date_effect_insured

Effective date of insured policy in the company (DD/MM/YYYY).

Date

date_lapse_insured

Lapse date of insured policy in the company (DD/MM/YYYY).

Date

date_effect_policy

Effective date of generic policy in the company (DD/MM/YYYY).

Date

date_lapse_policy

Lapse date of generic policy in the company (DD/MM/YYYY).

Date

year_effect_insured

year of date_effect_insured variable (YYYY).

Date

year_lapse_insured

year of date_lapse_insured variable (YYYY).

Date

year_effect_policy

year of date_effect_policy variable (YYYY).

Date

year_lapse_policy

year of date_lapse_policy variable (YYYY).

Date

exposure_time

Time, measured in years, representing the insured’s risk

 
 

exposure (value between zero and one).

Continuous

lapse

Policy Status Code

Factor

 

1: Lapse before expiration,

 
 

2: Active,

 
 

3: Lapse at (expiration) YYYY/12/31.

 

seniority_insured

Total number of years that the insured policy has been associated

Integer

 

with the insurance entity.

 

seniority_policy

Total number of years that the generic policy has been associated with

Integer

 

the insurance entity.

 

type_policy

(Generic) policy type, indicating whether the policy is

Factor

 

I: Individual,

 
 

C: Collective.

 

type_policy_dg

(Generic) policy type, disaggregated by group collectives

Factor

 

S: Self-Employed,

 
 

I: Individual,

 
 

C1: Collective 1,

 
 

C2: Collective 2,

 
 

C3: Collective 3,

 
 

C4: Collective 4.

 

type_product

(Generic) policy coverage

Factor

 

D: Dental,

 
 

P: Premium,

 
 

S: Standard,

 
 

I: International.

 

reimbursement

The act of reimbursing an amount of money to the individual

 
 

who originally paid it (yes or no)

Factor

new_business

Indicates whether the insured policy is new business or not (yes or no).

Binary

distribution_channel

Distribution channel through which the generic policy was processed.

Factor

 

A: Agency,

 
 

D: Direct business,

 
 

I: Insurance Intermediary.

 

gender

Gender of the insured.

Factor

 

M: Male,

 
 

F: Female.

 

age

Age of the insured, measured in whole years,

Integer

 

obtained by rounding to the closest integer

 
 

the exact (decimal) age of the insured.

Integer

premium

Net premium amount associated with the insured policy during the

 
 

current year.

Continuous

cost_claims_year

Total cost of claims for the insured policy during the current

Continuous

 

year.

 

n_medical_services

Total number of medical services for the insured policy

Integer

 

policy during the current year.