Overview

Dataset info

Number of variables 21
Number of observations 4124479
Total Missing (%) 22.4%
Total size in memory 660.8 MiB
Average record size in memory 168.0 B

Variables types

Numeric 2
Categorical 14
Boolean 0
Date 0
Text (Unique) 2
Rejected 3
Unsupported 0

Warnings

Variables

annee
Numeric

Distinct count 34
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Infinite (%) 0.0%
Infinite (n) 0
Mean 2015.2
Minimum 1976
Maximum 2018
Zeros (%) 0.0%

Quantile statistics

Minimum 1976
5-th percentile 2013
Q1 2014
Median 2015
Q3 2017
95-th percentile 2018
Maximum 2018
Range 42
Interquartile range 3

Descriptive statistics

Standard deviation 1.6606
Coef of variation 0.00082403
Kurtosis 0.11606
Mean 2015.2
MAD 1.3996
Skewness -0.22643
Sum 8311847329
Variance 2.7577
Memory size 31.5 MiB
Value Count Frequency (%)  
2014 791601 19.2%
 
2015 789256 19.1%
 
2017 768940 18.6%
 
2016 753403 18.3%
 
2013 495886 12.0%
 
2018 353278 8.6%
 
2012 158148 3.8%
 
2011 8306 0.2%
 
2010 2072 0.1%
 
2009 1672 0.0%
 
Other values (24) 1917 0.0%
 

Minimum 5 values

Value Count Frequency (%)  
1976 1 0.0%
 
1983 1 0.0%
 
1984 1 0.0%
 
1986 1 0.0%
 
1988 2 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
2014 791601 19.2%
 
2015 789256 19.1%
 
2016 753403 18.3%
 
2017 768940 18.6%
 
2018 353278 8.6%
 

categorie_generale
Constant

This variable is constant and should be ignored for analysis

Constant value

categorie_precise
Constant

This variable is constant and should be ignored for analysis

Constant value

catégorie_beneficiaire
Categorical

Distinct count 14
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Professionnel de santé
3691151
Etudiant
 
164512
Association professionnel de santé
 
70040
Other values (11)
 
198776
Value Count Frequency (%)  
Professionnel de santé 3691151 89.5%
 
Etudiant 164512 4.0%
 
Association professionnel de santé 70040 1.7%
 
Personnes morales assurant la formation initiale ou continue des professionnels de santé 69005 1.7%
 
Académies, Fondation, sociétés savantes, organismes de conseils 51409 1.2%
 
Etablissement de santé 39511 1.0%
 
Presse et média 23356 0.6%
 
Vétérinaire 8233 0.2%
 
Association usager de santé 5039 0.1%
 
Editeur de logiciel 939 0.0%
 
Other values (4) 1284 0.0%
 

date
Categorical

Distinct count 3743
Unique (%) 0.1%
Missing (%) 0.0%
Missing (n) 0
01/10/2013
 
10973
10/04/2014
 
10136
01/07/2013
 
9454
Other values (3740)
4093916
Value Count Frequency (%)  
01/10/2013 10973 0.3%
 
10/04/2014 10136 0.2%
 
01/07/2013 9454 0.2%
 
01/12/2016 9392 0.2%
 
20/03/2014 9279 0.2%
 
15/10/2015 9034 0.2%
 
01/10/2015 8980 0.2%
 
17/11/2016 8871 0.2%
 
05/12/2013 8630 0.2%
 
22/05/2014 8615 0.2%
 
Other values (3733) 4031115 97.7%
 

declaration_id
Categorical, Unique

First 3 values
Convention | IIXRHSGB | RPC-2014-01709_PS-0718142
Convention | CKJHICBF | 140521734-005875
Convention | SYCNVYRX | CO_2017_01_P_VEE_E-265...
Last 3 values
Convention | JLSLDVXR | RPS-2018-00119_PS-0337494
Convention | OPFMIKBW | 229069-CONV
Convention | NROJFJET | CTRMSD-20799_150323_C_...

First 10 values

Value Count Frequency (%)  
Convention | AADEBGMH | MANUEL_118201 1 0.0%
 
Convention | AADEBGMH | MANUEL_118215 1 0.0%
 
Convention | AADEBGMH | MANUEL_118236 1 0.0%
 
Convention | AADEBGMH | MANUEL_118239 1 0.0%
 
Convention | AADEBGMH | MANUEL_118242 1 0.0%
 

Last 10 values

Value Count Frequency (%)  
Convention | ZZSYDGYD | MANUEL_93179 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_93199 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_94120 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_94124 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_94163 1 0.0%
 

detail
Categorical

Distinct count 357318
Unique (%) 8.7%
Missing (%) 0.0%
Missing (n) 0
FORMATION -
 
344404
MARKETING -
 
141595
HOSPITALITE -
 
141273
Other values (357315)
3497207
Value Count Frequency (%)  
FORMATION - 344404 8.4%
 
MARKETING - 141595 3.4%
 
HOSPITALITE - 141273 3.4%
 
ENQUETE / ETUDE / ETUDE DE MARCHE HORS RECHERCHE - 136196 3.3%
 
ETUDE DE MARCHE - 114488 2.8%
 
CONVENTION D HOSPITALITE - 108658 2.6%
 
RECHERCHE SCIENTIFIQUE - 65290 1.6%
 
Non renseigné - 40219 1.0%
 
CONVENTION HOSPITALITE - 32409 0.8%
 
ACHAT D ESPACE - 30804 0.7%
 
Other values (357308) 2969143 72.0%
 

entreprise_émmetrice
Categorical

Distinct count 1593
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
Astrazeneca
 
312840
SANOFI SA
 
296424
MSD
 
286500
Other values (1590)
3228715
Value Count Frequency (%)  
Astrazeneca 312840 7.6%
 
SANOFI SA 296424 7.2%
 
MSD 286500 6.9%
 
GlaxoSmithKline 162365 3.9%
 
Bayer 120496 2.9%
 
Novartis 118979 2.9%
 
Roche 118598 2.9%
 
Johnson & Johnson 100306 2.4%
 
A+A 99986 2.4%
 
Boiron 96943 2.4%
 
Other values (1583) 2411042 58.5%
 

filiale_déclarante
Categorical

Distinct count 1750
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
ASTRAZENECA
 
312840
MSD France
 
284243
SANOFI AVENTIS FRANCE
 
272849
Other values (1747)
3254547
Value Count Frequency (%)  
ASTRAZENECA 312840 7.6%
 
MSD France 284243 6.9%
 
SANOFI AVENTIS FRANCE 272849 6.6%
 
LABORATOIRE GLAXOSMITHKLINE 159655 3.9%
 
Bayer HealthCare SAS 119588 2.9%
 
ROCHE SAS 116620 2.8%
 
A+A 99986 2.4%
 
BOIRON 95078 2.3%
 
JANSSEN-CILAG 90964 2.2%
 
WorldOne Group B.V. 85481 2.1%
 
Other values (1740) 2487175 60.3%
 

identifiant
Categorical

Distinct count 360090
Unique (%) 8.7%
Missing (%) 12.2%
Missing (n) 504919
[SO]
 
339798
SO
 
91895
0
 
11275
Other values (360086)
3176592
(Missing)
 
504919
Value Count Frequency (%)  
[SO] 339798 8.2%
 
SO 91895 2.2%
 
0 11275 0.3%
 
[AUTRE] 3912 0.1%
 
508331303 1810 0.0%
 
10003836375 1464 0.0%
 
10002393691 1375 0.0%
 
10100107985 1310 0.0%
 
10001589422 1293 0.0%
 
10002437373 1213 0.0%
 
Other values (360079) 3164215 76.7%
 
(Missing) 504919 12.2%
 

identifiant_convention
Categorical, Unique

First 3 values
Convention | IIXRHSGB | RPC-2014-01709_PS-0718142
Convention | CKJHICBF | 140521734-005875
Convention | SYCNVYRX | CO_2017_01_P_VEE_E-265...
Last 3 values
Convention | JLSLDVXR | RPS-2018-00119_PS-0337494
Convention | OPFMIKBW | 229069-CONV
Convention | NROJFJET | CTRMSD-20799_150323_C_...

First 10 values

Value Count Frequency (%)  
Convention | AADEBGMH | MANUEL_118201 1 0.0%
 
Convention | AADEBGMH | MANUEL_118215 1 0.0%
 
Convention | AADEBGMH | MANUEL_118236 1 0.0%
 
Convention | AADEBGMH | MANUEL_118239 1 0.0%
 
Convention | AADEBGMH | MANUEL_118242 1 0.0%
 

Last 10 values

Value Count Frequency (%)  
Convention | ZZSYDGYD | MANUEL_93179 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_93199 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_94120 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_94124 1 0.0%
 
Convention | ZZSYDGYD | MANUEL_94163 1 0.0%
 

identifiant_entreprise
Categorical

Distinct count 1765
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
SYCNVYRX
 
312840
NROJFJET
 
284243
IIXRHSGB
 
272849
Other values (1762)
3254547
Value Count Frequency (%)  
SYCNVYRX 312840 7.6%
 
NROJFJET 284243 6.9%
 
IIXRHSGB 272849 6.6%
 
JUJJFUKK 159655 3.9%
 
SOWWIAEU 119588 2.9%
 
HBBUBQTG 116620 2.8%
 
WSVLSWEO 99986 2.4%
 
ZJFVYRIK 95078 2.3%
 
XCKYTDRO 90964 2.2%
 
MDRHGABI 85481 2.1%
 
Other values (1755) 2487175 60.3%
 

montant_ttc
Numeric

Distinct count 11133
Unique (%) 0.3%
Missing (%) 89.2%
Missing (n) 3679119
Infinite (%) 0.0%
Infinite (n) 0
Mean 2095.7
Minimum -149
Maximum 33656000
Zeros (%) 0.0%

Quantile statistics

Minimum -149
5-th percentile 15
Q1 40
Median 60
Q3 460
95-th percentile 3290
Maximum 33656000
Range 33656000
Interquartile range 420

Descriptive statistics

Standard deviation 85266
Coef of variation 40.686
Kurtosis 113180
Mean 2095.7
MAD 3438.5
Skewness 306.38
Sum 933350000
Variance 7270300000
Memory size 31.5 MiB
Value Count Frequency (%)  
60.0 54322 1.3%
 
57.0 42207 1.0%
 
25.0 13475 0.3%
 
50.0 12851 0.3%
 
55.0 11177 0.3%
 
30.0 10458 0.3%
 
58.0 8652 0.2%
 
20.0 7140 0.2%
 
500.0 5256 0.1%
 
23.0 5196 0.1%
 
Other values (11122) 274626 6.7%
 
(Missing) 3679119 89.2%
 

Minimum 5 values

Value Count Frequency (%)  
-149.0 1 0.0%
 
1.0 593 0.0%
 
2.0 1331 0.0%
 
3.0 1367 0.0%
 
4.0 790 0.0%
 

Maximum 5 values

Value Count Frequency (%)  
6123611.0 1 0.0%
 
6700665.0 1 0.0%
 
12909753.0 2 0.0%
 
13892937.0 1 0.0%
 
33655638.0 2 0.0%
 

nom
Categorical

Distinct count 222869
Unique (%) 5.4%
Missing (%) 6.3%
Missing (n) 258605
MARTIN
 
8207
BERNARD
 
5324
THOMAS
 
5188
Other values (222865)
3847155
(Missing)
 
258605
Value Count Frequency (%)  
MARTIN 8207 0.2%
 
BERNARD 5324 0.1%
 
THOMAS 5188 0.1%
 
LAURENT 4600 0.1%
 
MICHEL 4316 0.1%
 
SIMON 4289 0.1%
 
NGUYEN 4262 0.1%
 
LEVY 4192 0.1%
 
COHEN 3912 0.1%
 
PETIT 3840 0.1%
 
Other values (222858) 3817744 92.6%
 
(Missing) 258605 6.3%
 

nom_prénom
Categorical

Distinct count 560681
Unique (%) 13.6%
Missing (%) 6.3%
Missing (n) 258585
COLMARD Patrick
 
1432
CHIDIAC Jean
 
1334
ULUSAKARYA Ayhan
 
1098
Other values (560677)
3862030
(Missing)
 
258585
Value Count Frequency (%)  
COLMARD Patrick 1432 0.0%
 
CHIDIAC Jean 1334 0.0%
 
ULUSAKARYA Ayhan 1098 0.0%
 
ALMOHAMAD Wathek 1098 0.0%
 
CRUMBACH Jean Pierre 1091 0.0%
 
ORFEUVRE Hubert 1089 0.0%
 
HAYDAR Mazen 999 0.0%
 
LUPORSI Elisabeth 977 0.0%
 
LAMURAGLIA Michele 969 0.0%
 
ASSAF Elias 961 0.0%
 
Other values (560670) 3854846 93.5%
 
(Missing) 258585 6.3%
 

profession
Categorical

Distinct count 25
Unique (%) 0.0%
Missing (%) 10.4%
Missing (n) 430960
Médecin
3049899
Pharmacien
 
242845
Infirmier
 
242311
Other values (21)
 
158464
(Missing)
 
430960
Value Count Frequency (%)  
Médecin 3049899 73.9%
 
Pharmacien 242845 5.9%
 
Infirmier 242311 5.9%
 
Préparateur en pharmacie et préparateur en pharmacie hospitalière 71210 1.7%
 
Sage-femme 17007 0.4%
 
Chirurgien-dentiste 16002 0.4%
 
Manipulateur d’électroradiologie médicale 14002 0.3%
 
Aide soignant 8029 0.2%
 
Masseur-kinésithérapeute 7630 0.2%
 
Opticien-lunetier 7158 0.2%
 
Other values (14) 17426 0.4%
 
(Missing) 430960 10.4%
 

prénom
Categorical

Distinct count 47074
Unique (%) 1.1%
Missing (%) 6.3%
Missing (n) 261114
Philippe
 
86783
Pierre
 
50686
Michel
 
46447
Other values (47070)
3679449
(Missing)
 
261114
Value Count Frequency (%)  
Philippe 86783 2.1%
 
Pierre 50686 1.2%
 
Michel 46447 1.1%
 
Catherine 43307 1.0%
 
Olivier 41901 1.0%
 
Isabelle 41843 1.0%
 
Francois 41121 1.0%
 
Alain 41027 1.0%
 
Patrick 38397 0.9%
 
Dominique 38062 0.9%
 
Other values (47063) 3393791 82.3%
 
(Missing) 261114 6.3%
 

specialité
Categorical

Distinct count 65
Unique (%) 0.0%
Missing (%) 46.7%
Missing (n) 1925737
Médecine Générale
554784
Autre
472160
Cardiologie et maladies vasculaires
 
138280
Other values (61)
1033518
(Missing)
1925737
Value Count Frequency (%)  
Médecine Générale 554784 13.5%
 
Autre 472160 11.4%
 
Cardiologie et maladies vasculaires 138280 3.4%
 
Pneumologie 86128 2.1%
 
Rhumatologie 65622 1.6%
 
Ophtalmologie 63738 1.5%
 
Neurologie 61308 1.5%
 
Psychiatrie 60288 1.5%
 
Gastro-entérologie et hépatologie 59480 1.4%
 
Endocrinologie et métabolisme 52348 1.3%
 
Other values (54) 584606 14.2%
 
(Missing) 1925737 46.7%
 

structure_bénéficiaire
Categorical

Distinct count 72787
Unique (%) 1.8%
Missing (%) 93.7%
Missing (n) 3863441
ADARE INTERNATIONAL LIMITED
 
2162
TERRE NEUVE
 
1011
CONSEIL MEDIA SANTE
 
845
Other values (72783)
 
257020
(Missing)
3863441
Value Count Frequency (%)  
ADARE INTERNATIONAL LIMITED 2162 0.1%
 
TERRE NEUVE 1011 0.0%
 
CONSEIL MEDIA SANTE 845 0.0%
 
IMS HEALTH 807 0.0%
 
VIVACTIS INNOVATIONS 806 0.0%
 
EDIMARK 770 0.0%
 
HOSPICES CIVILS DE LYON 736 0.0%
 
MCCANN HEALTHCARE 728 0.0%
 
ELSEVIER MASSON 632 0.0%
 
VIDAL ASSOCIES 601 0.0%
 
Other values (72776) 251940 6.1%
 
(Missing) 3863441 93.7%
 

type_declaration
Constant

This variable is constant and should be ignored for analysis

Constant value Convention

type_identifiant
Categorical

Distinct count 5
Unique (%) 0.0%
Missing (%) 0.0%
Missing (n) 0
RPPS
2910708
AUTRE
1089711
ORDRE
 
61185
Other values (2)
 
62875
Value Count Frequency (%)  
RPPS 2910708 70.6%
 
AUTRE 1089711 26.4%
 
ORDRE 61185 1.5%
 
SIREN 56999 1.4%
 
FINESS 5876 0.1%
 

Correlations

Sample

annee categorie_generale categorie_precise catégorie_beneficiaire date declaration_id detail entreprise_émmetrice filiale_déclarante identifiant identifiant_convention identifiant_entreprise montant_ttc nom nom_prénom profession prénom specialité structure_bénéficiaire type_declaration type_identifiant
0 2013 NaN NaN Professionnel de santé 11/12/2013 Convention | RHIKHDHD | 2013_S2_1626 MANIFESTAION A CARACTERE PROFESSIONNEL ET SCIE... VITALAIRE VITALAIRE 80000000789 Convention | RHIKHDHD | 2013_S2_1626 RHIKHDHD NaN GANOUN GANOUN Adila Médecin Adila Pneumologie NaN Convention AUTRE
1 2013 NaN NaN Etablissement de santé 09/09/2013 Convention | SWJKGAWY | NET-22 PRESTATION ANALYSE STATISTIQUE - GAMBRO INDUSTRIES GAMBRO INDUSTRIES 210780581 Convention | SWJKGAWY | NET-22 SWJKGAWY NaN NaN NaN NaN NaN NaN CHU DIJON Convention FINESS
2 2013 NaN NaN Personnes morales assurant la formation initia... 18/11/2013 Convention | MQKQLNIC | MANUEL_11383 Non renseigné - Manifestation : 'SUIVI DES VEN... SIGVARIS MANAGEMENT AG SIGVARIS 302695432 Convention | MQKQLNIC | MANUEL_11383 MQKQLNIC NaN NaN NaN NaN NaN NaN IMS HEALTH SAS Convention SIREN
3 2016 NaN NaN Professionnel de santé 07/10/2016 Convention | ZHCILBYG | BARDVS201603141 MANIFESTATION PROFESSIONNELLE - Manifestation ... BARD FRANCE SAS BARD FRANCE SAS 10100677847 Convention | ZHCILBYG | BARDVS201603141 ZHCILBYG NaN LEROUX LEROUX Geoffroy Médecin Geoffroy Chirurgie générale NaN Convention RPPS
4 2017 NaN NaN Etudiant 12/05/2017 Convention | ZJFVYRIK | NET-2017-S1-27464 FORMATION - Boiron BOIRON [SO] Convention | ZJFVYRIK | NET-2017-S1-27464 ZJFVYRIK NaN MUNO MUNO Anaelle Sage-femme Anaelle NaN NaN Convention AUTRE