The Data Factory, a team within the Data Office
Created in 2017 at Institut Curie, the Data Office has a tranversal mission on the data governance. The department works closely with local teams and external partners for optimizing data access and data exploration in order to create value. Within the department, the Data Factory is in charge of the implementation of all the innovative projects on health data from big data warehouses building to research services development including artificial intelligence projects.
The Data Factory
Julien Guérin
Chief Data Officer
"After a master degree in Bioinformatics, I spent several years in
IT consulting (Capgemini) working on big IT architectures for
public ministries. I joined Institut Curie in 2013 to develop an
institutional strategy for biological and clinical data
integration and analysis. Since september 2022, I'm leading the Data Office at Institut Curie."
Flavien Gilles
Head of Data Factory
"After specializing in data science at the end of my engineering degree, I worked at a startup (Lifen) where I was first responsible for designing machine learning pipelines (mostly NLP)
to extract data from medical documents. That went from defining the labelling strategy to experimenting with algorithms, evaluating and serving models in production.
I eventually got to build and maintain data services around a data warehouse to support the data driven strategy of the company. After 5 years I decided to cross the bridge
and joined Institut Curie as Head of Data Factory in May 2023."
Thomas Balezeau
Lead Data Engineer
"I joined the Data Factory in 2016 with a Master of Bioinformatics. As a Data Engineer,
I'm involved in the development of a data ecosystem in oncology and the growth of services
for Institut Curie's stakeholders and beyond. Our objectives are to build and promote innovative
solutions for healthcare and cancer research."
Aurélien Legros
Data Engineer / Data Scientist
"I've got a Master degree in Econometrics and Statistics. I've been working in different environments (
bank, insurance, Ville de Paris) before joining Institut Curie in 2021. I bring my experience in data quality processes
and I'm involved in the development of several key data projects which deal with collaborative science (Fairspace) and
data pseudonymisation (Octopus).
"
Victor Nguyen
Data Engineer
"I worked as Data Engineer apprentice at the French Ministry for Armed Forces, learning data analysis and processing for 3 years during my Engineering degree.
At the end of my degree, I joined the Data Factory in september 2021 to work on the implementation of the CbioPortal in Curie and help the team
to develop new products for innovation in healthcare."
Jessica Henao Henao
Data Scientist / Student
"I'm following a Master degree in Big Data and Data mining at University of Paris 8. In the meantime, I'm working within
the Data Factory on the NEOSTRUCT project which aims to implement AI algorithms to predict pathologic complete response (pCR)
for patients with a breast cancer and treated by neoadjuvant chemotherapy. I'm also involved in data structuration and data
quality control processes."
Publications
Delrieu L, Hamy AS, Coussy F, Kassara A, Asselain B, Antero J, De Villèle P, Dumas E, Forstmann N, Guérin J, Hotton J, Jouannaud C, Milder M, Leopold A, Sedeaud A, Soibinet P, Toussaint JF, Vercamer V, Laas E, Reyal F. BMC Cancer. 2022 May 4;22(1):493. doi: 10.1186/s12885-022-09608-y.
OSIRIS: A Minimum Data Set for Data Sharing and Interoperability in Oncology
Guerin J, Laizet Y, Le Texier V, Chanas L, Rance B, Koeppel F, Lion F, Gourgou S, Martin AL, Tejeda M, Toulmonde M, Cox S, Hess E, Rousseau-Tsangaris M, Jouhet V, Saintigny P. JCO Clinical Cancer Informatics. 2021 :5; 256-265. doi: 10.1200/CCI.20.00094.
Comedications influence immune infiltration and pathological response to neoadjuvant chemotherapy in breast cancer
Hamy AS, Derosa L, Valdelièvre C, Yonekura S, Opolon P, Priour M, Guerin J, Pierga JY, Asselain B, De Croze D, Pinheiro A, Lae M, Talagrand LS, Laas E, Darrigues L, Grandal B, Marangoni E, Montaudon E, Kroemer G, Zitvogel L, Reyal F. Oncoimmunology. 2019 Nov 14;9(1):1677427. doi: 10.1080/2162402X.2019.1677427.
No impact of smoking status on breast cancer tumor infiltrating lymphocytes, response to neoadjuvant chemotherapy and prognosis
Simon V, Laot L, Laas E, Rozette S, Guerin J, Balezeau T, Nicolas M, Pierga JY, Coussy F, Laé M, De Croze D, Grandal B, Abecassis J, Dumas E, Lerebours F, Reyal F, Hamy AS. Cancers (Basel). 2020 Oct 12;12(10):2943. doi: 10.3390/cancers12102943.
Comedications influence immune infiltration and pathological response to neoadjuvant chemotherapy in breast cancer
Hamy AS, Derosa L, Valdelièvre C, Yonekura S, Opolon P, Priour M, Guerin J, Pierga JY, Asselain B, De Croze D, Pinheiro A, Lae M, Talagrand LS, Laas E, Darrigues L, Grandal B, Marangoni E, Montaudon E, Kroemer G, Zitvogel L, Reyal F. Oncoimmunology. 2019 Nov 14;9(1):1677427. doi: 10.1080/2162402X.2019.1677427.
Text mining in electronic medical records enables quick and efficient identification of pregnancy cases occurring after breast cancer
Labrosse J, Lam GT, Sebbag C, Benque M, Abdennebi I, Merckelbagh H, Osdoit M, Priour M, Guerin J, Balezeau T, Grandal B, Coussy F, Bobrie A, Ferrer L, Laas E, Feron JG, Reyal F, Hamy AS. JCO Clin Cancer Inform. 2019 Oct;3:1-12. doi: 10.1200/CCI.19.00031.
Lymphovascular invasion after neoadjuvant chemotherapy is strongly associated with poor prognosis in breast carcinoma
Hamy AS, Lam GT, Laas E, Darrigues L, Balezeau T, Guerin J, Livartowski A, Sadacca B, Pierga JY, Vincent-Salomon A, Coussy F, Becette V, Bonsang-Kitzis H, Rouzier R, Feron JG, Benchimol G, Laé M, Reyal F. Breast Cancer Res Treat. 2018 Jun;169(2):295-304. doi: 10.1007/s10549-017-4610-0.