The Data Factory, a team within the Data Office

Created in 2017 at Institut Curie, the Data Office has a tranversal mission on the data governance. The department works closely with local teams and external partners for optimizing data access and data exploration in order to create value. Within the department, the Data Factory is in charge of the implementation of all the innovative projects on health data from big data warehouses building to research services development including artificial intelligence projects.


The Data Factory


Julien Guérin

Chief Data Officer

"After a master degree in Bioinformatics, I spent several years in IT consulting (Capgemini) working on big IT architectures for public ministries. I joined Institut Curie in 2013 to develop an institutional strategy for biological and clinical data integration and analysis. Since september 2022, I'm leading the Data Office at Institut Curie."

Flavien Gilles

Head of Data Factory

"After specializing in data science at the end of my engineering degree, I worked at a startup (Lifen) where I was first responsible for designing machine learning pipelines (mostly NLP) to extract data from medical documents. That went from defining the labelling strategy to experimenting with algorithms, evaluating and serving models in production. I eventually got to build and maintain data services around a data warehouse to support the data driven strategy of the company. After 5 years I decided to cross the bridge and joined Institut Curie as Head of Data Factory in May 2023."


Thomas Balezeau

Lead Data Engineer

"I joined the Data Factory in 2016 with a Master of Bioinformatics. As a Data Engineer, I'm involved in the development of a data ecosystem in oncology and the growth of services for Institut Curie's stakeholders and beyond. Our objectives are to build and promote innovative solutions for healthcare and cancer research."

Aurélien Legros

Data Engineer / Data Scientist

"I've got a Master degree in Econometrics and Statistics. I've been working in different environments ( bank, insurance, Ville de Paris) before joining Institut Curie in 2021. I bring my experience in data quality processes and I'm involved in the development of several key data projects which deal with collaborative science (Fairspace) and data pseudonymisation (Octopus). "


Victor Nguyen

Data Engineer

"I worked as Data Engineer apprentice at the French Ministry for Armed Forces, learning data analysis and processing for 3 years during my Engineering degree. At the end of my degree, I joined the Data Factory in september 2021 to work on the implementation of the CbioPortal in Curie and help the team to develop new products for innovation in healthcare."

Jessica Henao Henao

Data Scientist / Student

"I'm following a Master degree in Big Data and Data mining at University of Paris 8. In the meantime, I'm working within the Data Factory on the NEOSTRUCT project which aims to implement AI algorithms to predict pathologic complete response (pCR) for patients with a breast cancer and treated by neoadjuvant chemotherapy. I'm also involved in data structuration and data quality control processes."



Former members


  • Johan Archinard
  • Pier-Francesco Rocci
  • Armand Léopold
  • Oliver Hijano-Cubelos
  • Amel Yahou


Publications


Digital phenotyping in young breast cancer patients treated with neoadjuvant chemotherapy (the NeoFit Trial): protocol for a national, multicenter single-arm trial
Delrieu L, Hamy AS, Coussy F, Kassara A, Asselain B, Antero J, De Villèle P, Dumas E, Forstmann N, Guérin J, Hotton J, Jouannaud C, Milder M, Leopold A, Sedeaud A, Soibinet P, Toussaint JF, Vercamer V, Laas E, Reyal F. BMC Cancer. 2022 May 4;22(1):493. doi: 10.1186/s12885-022-09608-y.

OSIRIS: A Minimum Data Set for Data Sharing and Interoperability in Oncology
Guerin J, Laizet Y, Le Texier V, Chanas L, Rance B, Koeppel F, Lion F, Gourgou S, Martin AL, Tejeda M, Toulmonde M, Cox S, Hess E, Rousseau-Tsangaris M, Jouhet V, Saintigny P. JCO Clinical Cancer Informatics. 2021 :5; 256-265. doi: 10.1200/CCI.20.00094.

Comedications influence immune infiltration and pathological response to neoadjuvant chemotherapy in breast cancer
Hamy AS, Derosa L, Valdelièvre C, Yonekura S, Opolon P, Priour M, Guerin J, Pierga JY, Asselain B, De Croze D, Pinheiro A, Lae M, Talagrand LS, Laas E, Darrigues L, Grandal B, Marangoni E, Montaudon E, Kroemer G, Zitvogel L, Reyal F. Oncoimmunology. 2019 Nov 14;9(1):1677427. doi: 10.1080/2162402X.2019.1677427.

No impact of smoking status on breast cancer tumor infiltrating lymphocytes, response to neoadjuvant chemotherapy and prognosis
Simon V, Laot L, Laas E, Rozette S, Guerin J, Balezeau T, Nicolas M, Pierga JY, Coussy F, Laé M, De Croze D, Grandal B, Abecassis J, Dumas E, Lerebours F, Reyal F, Hamy AS. Cancers (Basel). 2020 Oct 12;12(10):2943. doi: 10.3390/cancers12102943.

Comedications influence immune infiltration and pathological response to neoadjuvant chemotherapy in breast cancer
Hamy AS, Derosa L, Valdelièvre C, Yonekura S, Opolon P, Priour M, Guerin J, Pierga JY, Asselain B, De Croze D, Pinheiro A, Lae M, Talagrand LS, Laas E, Darrigues L, Grandal B, Marangoni E, Montaudon E, Kroemer G, Zitvogel L, Reyal F. Oncoimmunology. 2019 Nov 14;9(1):1677427. doi: 10.1080/2162402X.2019.1677427.

Text mining in electronic medical records enables quick and efficient identification of pregnancy cases occurring after breast cancer
Labrosse J, Lam GT, Sebbag C, Benque M, Abdennebi I, Merckelbagh H, Osdoit M, Priour M, Guerin J, Balezeau T, Grandal B, Coussy F, Bobrie A, Ferrer L, Laas E, Feron JG, Reyal F, Hamy AS. JCO Clin Cancer Inform. 2019 Oct;3:1-12. doi: 10.1200/CCI.19.00031.

Lymphovascular invasion after neoadjuvant chemotherapy is strongly associated with poor prognosis in breast carcinoma
Hamy AS, Lam GT, Laas E, Darrigues L, Balezeau T, Guerin J, Livartowski A, Sadacca B, Pierga JY, Vincent-Salomon A, Coussy F, Becette V, Bonsang-Kitzis H, Rouzier R, Feron JG, Benchimol G, Laé M, Reyal F. Breast Cancer Res Treat. 2018 Jun;169(2):295-304. doi: 10.1007/s10549-017-4610-0.