Overview

Dataset info

Number of variables 21
Number of observations 9610684
Total Missing (%) 10.8%
Total size in memory 1.5 GiB
Average record size in memory 168.0 B

Variables types

Numeric 2
Categorical 17
Boolean 0
Date 0
Text (Unique) 1
Rejected 1
Unsupported 0

Warnings

Variables

annee
Numeric

Distinct count 20
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2015.3
Minimum 1980
Maximum 2018
Zeros (%) 0.0%

Quantile statistics

Minimum 1980
5-th percentile 2013
Q1 2014
Median 2015
Q3 2017
95-th percentile 2018
Maximum 2018
Range 38
Interquartile range 3

Descriptive statistics

Standard deviation 1.5695
Coef of variation 0.00077879
Kurtosis -0.74074
Mean 2015.3
MAD 1.3291
Skewness -0.04441
Sum 19367992963
Variance 2.4632
Memory size 73.3 MiB
Value Count Frequency (%)  
2015 1960052 20.4%
 
2014 1900217 19.8%
 
2016 1883933 19.6%
 
2017 1686752 17.6%
 
2013 1177194 12.2%
 
2018 745494 7.8%
 
2012 255799 2.7%
 
2011 814 0.0%
 
2006 186 0.0%
 
2004 66 0.0%
 
Other values (10) 177 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
1980 3 0.0%
 
2000 9 0.0%
 
2001 19 0.0%
 
2002 3 0.0%
 
2003 6 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2014 1900217 19.8%
 
2015 1960052 20.4%
 
2016 1883933 19.6%
 
2017 1686752 17.6%
 
2018 745494 7.8%
 

categorie_generale
Categorical

Distinct count 6
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Contribution au coût d'événements promotionnels, scientifique ou professionel
8889393
Service et Conseil
 
377963
Sans classe
 
233081
Other values (3)
 
110247
Value Count Frequency (%)  
Contribution au coût d'événements promotionnels, scientifique ou professionel 8889393 92.5%
 
Service et Conseil 377963 3.9%
 
Sans classe 233081 2.4%
 
Cadeaux 51486 0.5%
 
Dons et Subventions 47836 0.5%
 
Formation 10925 0.1%
 

categorie_precise
Categorical

Distinct count 16
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Restauration
6697420
Hospitalité
 
629789
Transport
 
576380
Other values (13)
1707095
Value Count Frequency (%)  
Restauration 6697420 69.7%
 
Hospitalité 629789 6.6%
 
Transport 576380 6.0%
 
Hébergement 518319 5.4%
 
Contribution au coût d'événements promotionnels, scientifique ou professionel 462666 4.8%
 
Rémunération 360350 3.7%
 
Association à une catégorie non réussie 177402 1.8%
 
Vide, Autre 55679 0.6%
 
Cadeaux 51486 0.5%
 
Dons de sommes d'argent 21531 0.2%
 
Other values (6) 59662 0.6%
 

catégorie_beneficiaire
Categorical

Distinct count 14
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Professionnel de santé
9064954
Etudiant
 
456060
Vétérinaire
 
27578
Other values (11)
 
62092
Value Count Frequency (%)  
Professionnel de santé 9064954 94.3%
 
Etudiant 456060 4.7%
 
Vétérinaire 27578 0.3%
 
Association professionnel de santé 21514 0.2%
 
Etablissement de santé 13911 0.1%
 
Personnes morales assurant la formation initiale ou continue des professionnels de santé 10694 0.1%
 
Académies, Fondation, sociétés savantes, organismes de conseils 6738 0.1%
 
Association usager de santé 4764 0.0%
 
Presse et média 4060 0.0%
 
Editeur de logiciel 246 0.0%
 
Other values (4) 165 0.0%
 

date
Categorical

Distinct count 2562
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
10/04/2014
 
21539
24/11/2016
 
18828
28/11/2013
 
18771
Other values (2559)
9551546
Value Count Frequency (%)  
10/04/2014 21539 0.2%
 
24/11/2016 18828 0.2%
 
28/11/2013 18771 0.2%
 
04/12/2014 18684 0.2%
 
26/03/2015 18673 0.2%
 
20/03/2014 18523 0.2%
 
19/11/2015 18478 0.2%
 
10/12/2015 18326 0.2%
 
20/11/2014 18260 0.2%
 
17/11/2016 18227 0.2%
 
Other values (2552) 9422375 98.0%
 

declaration_id
Categorical, Unique

First 3 values
Avantage | IIXRHSGB | RPN-2015-22253_PS-059671...
Avantage | PTBTSRUS | 2018-E10830-02-561-1_A_5...
Avantage | QRUSPVYJ | RP11076AP1923493
Last 3 values
Avantage | LFSWZPFF | 6H22015CON3568
Avantage | JUJJFUKK | AVANT-050268
Avantage | XCKYTDRO | 2013S2_PPA_08090

First 10 values

Value Count Frequency (%)  
Avantage | AADEBGMH | 01032018_AVT_1 1 0.0%
 
Avantage | AADEBGMH | 01032018_AVT_10 1 0.0%
 
Avantage | AADEBGMH | 01032018_AVT_11 1 0.0%
 
Avantage | AADEBGMH | 01032018_AVT_12 1 0.0%
 
Avantage | AADEBGMH | 01032018_AVT_13 1 0.0%
 

Last 10 values

Value Count Frequency (%)  
Avantage | ZZSYDGYD | MANUEL_103455 1 0.0%
 
Avantage | ZZSYDGYD | MANUEL_103465 1 0.0%
 
Avantage | ZZSYDGYD | MANUEL_103480 1 0.0%
 
Avantage | ZZSYDGYD | MANUEL_103483 1 0.0%
 
Avantage | ZZSYDGYD | MANUEL_103492 1 0.0%
 

detail
Categorical

Distinct count 5675
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
REPAS
5991843
HOSPITALITE
 
614322
TRANSPORT
 
563587
Other values (5672)
2440932
Value Count Frequency (%)  
REPAS 5991843 62.3%
 
HOSPITALITE 614322 6.4%
 
TRANSPORT 563587 5.9%
 
HEBERGEMENT 501268 5.2%
 
RESTAURATION 372926 3.9%
 
ENQUETE 316012 3.3%
 
PARTICIPATION EVENEMENT SCIENTIFIQUE 173389 1.8%
 
INSCRIPTION 172518 1.8%
 
Non renseigné 55679 0.6%
 
CADEAUX 47895 0.5%
 
Other values (5665) 801245 8.3%
 

entreprise_émmetrice
Categorical

Distinct count 1638
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Novartis
 
507800
Astrazeneca
 
469510
SANOFI SA
 
456822
Other values (1635)
8176552
Value Count Frequency (%)  
Novartis 507800 5.3%
 
Astrazeneca 469510 4.9%
 
SANOFI SA 456822 4.8%
 
MSD 418472 4.4%
 
GlaxoSmithKline 325903 3.4%
 
Johnson & Johnson 253466 2.6%
 
ICOMED 233472 2.4%
 
Roche 221988 2.3%
 
Bayer 213101 2.2%
 
Bristol-Myers Squibb 206782 2.2%
 
Other values (1628) 6303368 65.6%
 

filiale_déclarante
Categorical

Distinct count 1817
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
ASTRAZENECA
 
469510
NOVARTIS PHARMA SAS
 
444943
MSD France
 
413926
Other values (1814)
8282305
Value Count Frequency (%)  
ASTRAZENECA 469510 4.9%
 
NOVARTIS PHARMA SAS 444943 4.6%
 
MSD France 413926 4.3%
 
SANOFI AVENTIS FRANCE 413210 4.3%
 
LABORATOIRE GLAXOSMITHKLINE 321600 3.3%
 
ICOMED 233472 2.4%
 
JANSSEN-CILAG 218701 2.3%
 
Bayer HealthCare SAS 211734 2.2%
 
BRISTOL-MYERS SQUIBB 206103 2.1%
 
AbbVie 199837 2.1%
 
Other values (1807) 6477648 67.4%
 

identifiant
Categorical

Distinct count 477155
Unique (%) 5.0%
Missing (%) 15.4%
Missing (n) 1483270
[SO]
 
810488
SO
 
127594
0
 
30352
Other values (477151)
7158980
(Missing)
1483270
Value Count Frequency (%)  
[SO] 810488 8.4%
 
SO 127594 1.3%
 
0 30352 0.3%
 
[AUTRE] 10182 0.1%
 
10000000000 9528 0.1%
 
00000000000 8677 0.1%
 
BENEF_IDENTIFIANT_VALEUR 7297 0.1%
 
[BR] 5655 0.1%
 
NON RENSEIGNE 3765 0.0%
 
[0] 3718 0.0%
 
Other values (477144) 7110158 74.0%
 
(Missing) 1483270 15.4%
 

identifiant_convention
Categorical

Distinct count 1203698
Unique (%) 12.5%
Missing (%) 67.8%
Missing (n) 6516405
Convention | YLWGMHGC | Réunion Scientifique - Participant
 
185315
Convention | YLWGMHGC | RÉUNION SCIENTIFIQUE - PARTICIPANT
 
122701
Convention | SYCNVYRX | convention liée : RP Formation
 
108065
Other values (1203694)
2678198
(Missing)
6516405
Value Count Frequency (%)  
Convention | YLWGMHGC | Réunion Scientifique - Participant 185315 1.9%
 
Convention | YLWGMHGC | RÉUNION SCIENTIFIQUE - PARTICIPANT 122701 1.3%
 
Convention | SYCNVYRX | convention liée : RP Formation 108065 1.1%
 
Convention | YLWGMHGC | Réunion scientifique - Participant 56292 0.6%
 
Convention | NROJFJET | ; 54797 0.6%
 
Convention | OUMNRESV | " 50286 0.5%
 
Convention | YLWGMHGC | REUNION SCIENTIFIQUE - PARTICIPANT 35683 0.4%
 
Convention | SYCNVYRX | Convention liée : RP Formation 28337 0.3%
 
Convention | SYCNVYRX | convention liée : congrès/symposium 14077 0.1%
 
Convention | PTBTSRUS | - 10821 0.1%
 
Other values (1203687) 2427905 25.3%
 
(Missing) 6516405 67.8%
 

identifiant_entreprise
Categorical

Distinct count 1833
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
SYCNVYRX
 
469510
YLWGMHGC
 
444943
NROJFJET
 
413926
Other values (1830)
8282305
Value Count Frequency (%)  
SYCNVYRX 469510 4.9%
 
YLWGMHGC 444943 4.6%
 
NROJFJET 413926 4.3%
 
IIXRHSGB 413210 4.3%
 
JUJJFUKK 321600 3.3%
 
TJQKCKUX 233472 2.4%
 
XCKYTDRO 218701 2.3%
 
SOWWIAEU 211734 2.2%
 
RIZLLUIF 206103 2.1%
 
LFSWZPFF 199837 2.1%
 
Other values (1823) 6477648 67.4%
 

montant_ttc
Numeric

Distinct count 10711
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 137.35
Minimum 10
Maximum 2784832
Zeros (%) 0.0%

Quantile statistics

Minimum 10
5-th percentile 14
Q1 23
Median 40
Q3 60
95-th percentile 400
Maximum 2784832
Range 2784822
Interquartile range 37

Descriptive statistics

Standard deviation 3039.5
Coef of variation 22.129
Kurtosis 251800
Mean 137.35
MAD 170.17
Skewness 401.1
Sum 1320056680
Variance 9238600
Memory size 73.3 MiB
Value Count Frequency (%)  
60 745337 7.8%
 
20 389984 4.1%
 
57 319152 3.3%
 
55 299522 3.1%
 
50 290245 3.0%
 
30 283317 2.9%
 
25 269310 2.8%
 
23 220294 2.3%
 
19 209663 2.2%
 
18 197174 2.1%
 
Other values (10701) 6386686 66.5%
 

Minimum 5 values

Value Count Frequency (%)  
10 139763 1.5%
 
11 103054 1.1%
 
12 105952 1.1%
 
13 107766 1.1%
 
14 127590 1.3%
 

Maximum 5 values

Value Count Frequency (%)  
2000000 1 0.0%
 
2054000 1 0.0%
 
2400000 1 0.0%
 
2464769 1 0.0%
 
2784832 1 0.0%
 

nom
Categorical

Distinct count 347699
Unique (%) 3.6%
Missing (%) 0.6%
Missing (n) 60580
MARTIN
 
20466
BERNARD
 
12910
THOMAS
 
11164
Other values (347695)
9505564
(Missing)
 
60580
Value Count Frequency (%)  
MARTIN 20466 0.2%
 
BERNARD 12910 0.1%
 
THOMAS 11164 0.1%
 
DURAND 10954 0.1%
 
NGUYEN 10643 0.1%
 
SIMON 10395 0.1%
 
LAURENT 10193 0.1%
 
PETIT 10108 0.1%
 
ROBERT 9362 0.1%
 
BONNET 9293 0.1%
 
Other values (347688) 9434616 98.2%
 
(Missing) 60580 0.6%
 

nom_prénom
Categorical

Distinct count 958069
Unique (%) 10.0%
Missing (%) 0.6%
Missing (n) 60382
BOUHARAOUA Ahmed
 
930
DIEUZAIDE Pierre
 
916
KHATTAR Pierre
 
914
Other values (958065)
9547542
(Missing)
 
60382
Value Count Frequency (%)  
BOUHARAOUA Ahmed 930 0.0%
 
DIEUZAIDE Pierre 916 0.0%
 
KHATTAR Pierre 914 0.0%
 
DEFAYE Pascal 867 0.0%
 
CHEVALIER Nicolas 824 0.0%
 
HAGER Francois Xavier 819 0.0%
 
DOBI Erion 816 0.0%
 
ANSELME Frederic 815 0.0%
 
BAH Thierno 790 0.0%
 
FLIPO Rene Marc 789 0.0%
 
Other values (958058) 9541822 99.3%
 
(Missing) 60382 0.6%
 

profession
Categorical

Distinct count 25
Unique (%) 0.0%
Missing (%) 5.6%
Missing (n) 539443
Médecin
6932699
Infirmier
 
767486
Pharmacien
 
603950
Other values (21)
 
767106
(Missing)
 
539443
Value Count Frequency (%)  
Médecin 6932699 72.1%
 
Infirmier 767486 8.0%
 
Pharmacien 603950 6.3%
 
Préparateur en pharmacie et préparateur en pharmacie hospitalière 277542 2.9%
 
Chirurgien-dentiste 135839 1.4%
 
Manipulateur d’électroradiologie médicale 87590 0.9%
 
Opticien-lunetier 58473 0.6%
 
Audioprothésiste 48484 0.5%
 
Sage-femme 34633 0.4%
 
Aide soignant 30245 0.3%
 
Other values (14) 94300 1.0%
 
(Missing) 539443 5.6%
 

prénom
Categorical

Distinct count 58379
Unique (%) 0.6%
Missing (%) 0.7%
Missing (n) 63706
Philippe
 
198407
Pierre
 
120048
Catherine
 
108896
Other values (58375)
9119627
Value Count Frequency (%)  
Philippe 198407 2.1%
 
Pierre 120048 1.2%
 
Catherine 108896 1.1%
 
Isabelle 108675 1.1%
 
Olivier 106280 1.1%
 
Michel 98588 1.0%
 
Francois 98534 1.0%
 
Alain 89518 0.9%
 
Dominique 89196 0.9%
 
Eric 87364 0.9%
 
Other values (58368) 8441472 87.8%
 

specialité
Categorical

Distinct count 65
Unique (%) 0.0%
Missing (%) 35.7%
Missing (n) 3435115
Médecine Générale
1364017
Autre
1355481
Cardiologie et maladies vasculaires
 
483837
Other values (61)
2972234
(Missing)
3435115
Value Count Frequency (%)  
Médecine Générale 1364017 14.2%
 
Autre 1355481 14.1%
 
Cardiologie et maladies vasculaires 483837 5.0%
 
Pneumologie 243835 2.5%
 
Ophtalmologie 214177 2.2%
 
Neurologie 172528 1.8%
 
Gastro-entérologie et hépatologie 165952 1.7%
 
Rhumatologie 161480 1.7%
 
Endocrinologie et métabolisme 150379 1.6%
 
Dermatologie et vénéréologie 141079 1.5%
 
Other values (54) 1722804 17.9%
 
(Missing) 3435115 35.7%
 

structure_bénéficiaire
Categorical

Distinct count 27332
Unique (%) 0.3%
Missing (%) 99.3%
Missing (n) 9543213
ADARE INTERNATIONAL LIMITED
 
1467
MCCANN HEALTHCARE
 
323
VIDAL ASSOCIES
 
235
Other values (27328)
 
65446
(Missing)
9543213
Value Count Frequency (%)  
ADARE INTERNATIONAL LIMITED 1467 0.0%
 
MCCANN HEALTHCARE 323 0.0%
 
VIDAL ASSOCIES 235 0.0%
 
GERS 190 0.0%
 
SDWA 168 0.0%
 
FULLSIX 156 0.0%
 
HOPITAL HENRI MONDOR 140 0.0%
 
LA FONDERIE 139 0.0%
 
HOPITAL FOCH 133 0.0%
 
CHILTERN INTERNATIONAL SARL 131 0.0%
 
Other values (27321) 64389 0.7%
 
(Missing) 9543213 99.3%
 

type_declaration
Constant

This variable is constant and should be ignored for analysis

Constant value Avantage

type_identifiant
Categorical

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
RPPS
6814127
AUTRE
2670425
ORDRE
 
109746
Other values (2)
 
16386
Value Count Frequency (%)  
RPPS 6814127 70.9%
 
AUTRE 2670425 27.8%
 
ORDRE 109746 1.1%
 
SIREN 15363 0.2%
 
FINESS 1023 0.0%
 

Correlations

Sample

annee categorie_generale categorie_precise catégorie_beneficiaire date declaration_id detail entreprise_émmetrice filiale_déclarante identifiant identifiant_convention identifiant_entreprise montant_ttc nom nom_prénom profession prénom specialité structure_bénéficiaire type_declaration type_identifiant
0 2013 Contribution au coût d'événements promotionnel... Restauration Professionnel de santé 28/03/2013 Avantage | PDPRJEHO | 120267 REPAS ETHICON ETHICON 10004148077 NaN PDPRJEHO 50 BEAUMONT BEAUMONT Cecile Pharmacien Cecile Autre NaN Avantage RPPS
1 2017 Contribution au coût d'événements promotionnel... Hébergement Professionnel de santé 03/03/2017 Avantage | INJMXEAE | RPC-2017-0024_PS-0127692... HEBERGEMENT Amgen AMGEN SAS NaN Convention | INJMXEAE | RPC-2017-0024_PS-0127692 INJMXEAE 155 DANDOY DANDOY Simon Médecin Simon NaN NaN Avantage AUTRE
2 2015 Contribution au coût d'événements promotionnel... Hospitalité Professionnel de santé 29/08/2015 Avantage | SOWWIAEU | A_1-IIIVTU67890_1-8K4U-8566 HOSPITALITE Bayer Bayer HealthCare SAS 10003407714 Convention | SOWWIAEU | C_1-IIIVTU67890_1-8K4U... SOWWIAEU 60 BERTRAND BERTRAND Jean Henri Médecin Jean Henri NaN NaN Avantage RPPS
3 2017 Sans classe Association à une catégorie non réussie Professionnel de santé 04/03/2017 Avantage | INJMXEAE | RPC-2017-0024_PS-0127692... PAUSE Amgen AMGEN SAS NaN Convention | INJMXEAE | RPC-2017-0024_PS-0127692 INJMXEAE 10 DANDOY DANDOY Simon Médecin Simon NaN NaN Avantage AUTRE
4 2017 Contribution au coût d'événements promotionnel... Restauration Professionnel de santé 23/03/2017 Avantage | INJMXEAE | RPC-2017-0059_PS-0161147... DINER REUNION Amgen AMGEN SAS 10100100360 Convention | INJMXEAE | RPC-2017-0059_PS-0161147 INJMXEAE 54 SAAD SAAD Hussam Médecin Hussam NaN NaN Avantage RPPS