Expertise

Hub members Have many expertise, covering most of the fields in bioinformatics and biostatistics. You'll find below a non-exhaustive list of these expertise

Search by keywords | Search by organisms

Searched keyword : Scientific computing

Related people (22)

Thomas BIGOT

Group : GIPhy - Embedded : Biology of Infection

I joined the C3BI Hub in 2016 after a curriculum widely dedicated to Bioinformatics studies, and more precisely to Phylogeny and Evolution, topics of my PhD thesis. At Institut Pasteur, I am involved in projects dealing with sequences homology : alignments, hmm profiles, making homologous family databases, kmers signatures. I am also a developer (Python / C++) with a solid interest in optimization as well as in developing usable tools for final user such as automated pipeline for metagenomics sequence analysis. I’m currently embedded in Marc Eloit’s team (80% of my work time). My main task in this team is to develop strategies to identify, in their metagenomics samples, new pathogens, or new combination pathogen / symptoms. The rest of my time, I manage small projects and participate to the Hub life. I am currently experimenting with functional programming (for now, using Python) and its applicability to bioinformatics issues.


Keywords
AlgorithmicsScientific computingSofware development and engineeringParallel computingGraph theory and analysis
Organisms
BacteriaFungiVirus
Projects (8)

Freddy CLIQUET


One of my projects consists in developing GRAVITY, a java tool based on Cytoscape to integrate genetic variants within protein-protein interaction networks to allow the visual and statistical interpretation of next-generation sequencing data, ultimately helping geneticists and clinicians to identify causal variants and better diagnose their patients. I’m also involved in several other projects in the lab, taking part in the design of pipelines for the processing and the analysis of genomics data, including SNP arrays, whole-exome and whole-genome sequencing data. This means being confronted to the big data problematic, the unit having to manage hundreds of terabytes of genomics data. Finally, I am now analysing these data in order to identify possible causes for autism, to help clinicians with their diagnosis but also to better understand the biological mechanisms at play in this complex disease. This is done through the project aiming at understanding the genetic architecture of autism in the Faroe Islands, and also with the newly starting IMI2 European project AIMS2-Trials.


Keywords
AlgorithmicsData managementData VisualizationGenomicsMachine learningProteomicsGenome analysisBiostatisticsProgram developmentScientific computingApplication of mathematics in sciencesExploratory data analysisSofware development and engineeringData and text miningGenetics
Organisms

Projects (0)

    Thomas COKELAER

    Group : DETACHED - Detached : Biomics

    I joined the Bioinformatics and Biostatistics Hub at Institut Pasteur in 2016 where I am currently developing pipelines related to NGS for the Biomics Pôle. I have an interdisciplinary research experience: after a PhD in Astronomy (gravitational wave data analysis), I joined several research institute to work in the fields of plant modelling (INRIA, Montpellier, 2008-2011), System Biology — in particular logical modelling (EMBL-EBI Cambridge, U.K., 2011-2015), and drug discovery (Sanger Institute, Cambridge, U.K.), 2015). On a daily basis, I use data analysis and machine learning techniques within high-quality software to tackle scientific problems.


    Keywords
    AlgorithmicsData managementData VisualizationGenome assemblyGenomicsMachine learningModelingScientific computingDatabases and ontologiesSofware development and engineeringData and text miningIllumina HiSeqGraph theory and analysisIllumina MiSeq
    Organisms

    Projects (2)

    Anđela DAVIDOVIĆ

    Group : SINGLE - Embedded :

    I have a joint MSc degree in Mathematical Modelling from three European universities: University of L’Aquila (Italy), University of Nice-Sophia Antipolis (France) and Autonomous University of Barcelona (Spain). I also hold a PhD degree in Applied Mathematics and Scientific Computing from University of Bordeaux, France. I have done my PhD and one year of post-doc at INRIA Bordeaux Sud-Ouest, and partially at IHU-Liryc. During this time I studied how electrical signals propagate through the cardiac tissue under certain diseased conditions. My model of interest was the bidomain model, which is a system of partial differential equations that takes into account physiological properties of the cardiac cells and the spatial organization of the cardiac tissue. I worked on the mathematical multiscale analysis and numerical simulations of the problem to understand how structural changes of the tissue affect the propagation of the signal on the heart level. I collaborated with biologists and engineers of the IHU-Liryc to apply my model on a rat heart using high-resolution MRI data. For this I also worked on image analysis and image processing. I’ve joined the Institute Pasteur in February 2018 as a member of the HUB in Bioinformatics and Biostatistics. Currently I am working on stochastic mathematical modeling and inference for systems biology, gene expression, RNA transcription, etc.


    Keywords
    ModelingScientific computingApplication of mathematics in sciencesGraphics and Image Processing
    Organisms
    BacteriaFungiInsect or arthropodEscherichia coliSaccharomyces cerevisiaeFly
    Projects (2)

    Olivia DOPPELT-AZEROUAL

    Group : WINTER - Hub Core

    ONGOING PROJECTS Galaxy administration/Maintenance (https://galaxy.web.pasteur.fr) Bioweb: Future directory of bioinformatics resources at the Institut Pasteur ELIXIR Registry SKILLS Galaxy: administration, API/Bioblend expertise Programming: Python, Javascript, Lua, R, Development tools: GIT, Subversion, Emacs Database: NoSQL (couchdb), MySQL, PostgreSQL Bioinformatics: Preprocessing NGS data, MED-SuMo, Protein surface comparison, Protein functional annotation. OTHER ACTIVITIES C3BI seminars and meetings management Involved in Galaxy France Working Group (IFB) FORMER PROJECTS MetaGenSense(https://metagensense.web.pasteur.fr) Disco-Bac (https://disco-bac.web.pasteur.fr)


    Keywords
    Data managementSequence analysisStructural bioinformaticsDatabaseProgram developmentScientific computingLIMS
    Organisms

    Projects (5)

    Amine GHOZLANE

    Group : SINGLE - Detached : Biomics

    After a PhD in informatics on graph analysis (metabolic networks and sRNA-mRNA interaction graphs) at the LaBRI (Université de Bordeaux), I joined the DSIMB team (INTS) for a post-doc on structural modeling. Then, I performed a second post-doc at Metagenopolis – INRA Jouy-en-Josas, where I was initiated to the analysis of metagenomic data. I was recruited at the HUB in 2015, and since I pursue the development of methods dedicated to the treatment of metagenomic data by combining either the treatment of sequencing data, the statistics, the protein structural modeling and the graph analysis.


    Keywords
    AlgorithmicsClusteringGenome assemblyGenomicsMetabolomicsModelingNon coding RNASequence analysisStructural bioinformaticsTargeted metagenomicsDatabaseGenome analysisBiostatisticsProgram developmentScientific computingDatabases and ontologiesExploratory data analysisData and text miningIllumina HiSeqComparative metagenomicsRead mappingIllumina MiSeqSequence homology analysisGene predictionMultidimensional data analysisSequencingShotgun metagenomics
    Organisms

    Projects (21)

    Bernd JAGLA

    Group : PLATEFORM - Detached : Biomarker Discovery

    Bernd Jagla received his PhD in bioinformatics (department of Biology, Chemistry, and Parmacy) from the Free University in Berlin, Germany in 1999. Before joining the Institut Pasteur, he worked for almost ten years in New York City, including as an associate research scientist in the Joint Centers for System Biology (Columbia University) and at the Columbia University Screening Center led by Dr J.E. Rothman. He joined the Institut Pasteur in 2009 to take charge of the bioinformatic needs at the Transcriptome et Epigenome platform, focusing on Next Generation Sequencing. As of 2016 he is member of the C3BI – HUB Team detached to the Human immunology center (CIH) and provides support for cytometry, next generation sequencing, and microarray data analysis. His areas of interest include the quality assurance and data analysis and visualization at the facility. He also has strong expertise in developing algorithms for function prediction from sequence data, image analysis, analysis of mass spectrometry data, workflow management systems. While at Pasteur he developed: KNIME extensions for Next Generation Sequencing (Link) Post Alignment Visualization and Characterization of High-Throughput Sequencing Experiments (Link) Post Alignment statistics of Illumina reads (Link)


    Keywords
    AlgorithmicsChIP-seqData managementData VisualizationImage analysisMachine learningSequence analysisDatabaseGenome analysisBiostatisticsProgram developmentScientific computingData and text miningIllumina HiSeqGraphics and Image ProcessingIllumina MiSeqHigh Throughput ScreeningFlow cytometry/cell sortingPac Bio
    Organisms

    Projects (1)

    Rachel LEGENDRE

    Group : PLATEFORM - Detached : Biomics

    Rachel Legendre is a bioinformatics engineer. She completed her master degree in apprenticeship for two years at INRA in Jouy-en-Josas in the Genetic Animal department. She was involved in a project aiming at the detection and the expression analysis of micro-RNA involved in an equine disease. In 2012, she joined the Genomic, Structure and Translation Team at Paris-Sud (Paris XI) university. She worked principally on Ribosome Profiling data analysis, a new technique that allows to identify the position of the ribosome on the mRNA at the nucleotide level. Since November 2015, she joined the Bioinformatics and Biostatistics HUB at Pasteur Institute and she’s detached to the Biomics Pole in C2RT, where she is in charge of the bioinformatics analyses for transcriptomics and epigenomics projects. She’s also involved in Long Reads (PacBio and Nanopore) developments with other bioinformaticians of Biomics Pole.


    Keywords
    AlgorithmicsChIP-seqEpigenomicsNon coding RNATranscriptomicsGenome analysisProgram developmentScientific computingSofware development and engineeringIllumina HiSeqRead mappingSequencingWorkflow and pipeline developmentChromatin accessibility assaysPac BioRibosome profiling
    Organisms
    BacteriaFungiParasiteHumanInsect or arthropodOther animal
    Projects (11)

    Frédéric LEMOINE


    After a Master degree in bioinformatics and biostatistics, I did a PhD in computer science / bioinformatics at University Paris-Sud (now in University Paris-Saclay), where I worked on integration and analysis of comparative genomics data. After a postdoc in Lausanne, Switzerland where I worked on small-RNA sequencing data, I joined GenoSplice where I was responsible for the development of bioinformatics projects related to next generation sequencing. I joined Institut Pasteur in Nov. 2015, to work in the Evolutionary Bioinformatics Unit and participate in the development of new tools and algorithms that are able to tackle efficiently the ever increasing amount of sequencing data.


    Keywords
    AlgorithmicsData managementPhylogeneticsSequence analysisDatabaseGenome analysisProgram developmentScientific computingDatabases and ontologiesSequencingWorkflow and pipeline development
    Organisms

    Projects (0)

      Nicolas MAILLET

      Group : ALPS - Embedded : Structural Virology

      After a PhD in bioinformatics at Inria/IRISA, Université de Rennes 1, Rennes (France), under the supervision of Dominique Lavenier and Pierre Peterlongo, I did a postdoc in bioinformatics at Laboratory of Ecology and Evolution of Plankton in Stazione Zoologica Anton Dohrn of Naples, Italy. Both my thesis and my postdoc were about the Tara Oceans projet and the development of new software to analyze huge quantities of raw reads coming from metagenomics sample. I am currently occupying a research engineer position at the Hub as leader of ALPS group and focus on several different computing problems including metagenomics, protein assembly and several short term developments.


      Keywords
      AlgorithmicsData managementProteomicsDatabaseProgram developmentScientific computingSofware development and engineeringComparative metagenomics
      Organisms

      Projects (8)

      Christophe MALABAT

      Group : HEAD - Hub Core

      After a PhD in biochemistry of the rapeseed proteins, during which I developed my first automated scripts for handling data processing and analysis, I join Danone research facility center for developing multivariate models for the prediction of milk protein composition using infrared spectrometry.
      As I was already developing my own informatics tools, I decided to join the course of informatic for biology of the Institut Pasteur in 2007. At the end of the course I was recruited by the Institute and integrate the unit of “génétique des interactions macromoléculaires” of Alain Jacquier. Within this group, I learn to handle sequencing data and I developed processing and analysis tools using python and R. I also create a genome browser and database system for storing, retrieving and visualizing microarray data. After 8 years within the Alain Jacquier’s lab, I join the Hub of bioinformatics and biostatistics as co-head of the team.


      Keywords
      ClusteringData managementSequence analysisTranscriptomicsWeb developmentDatabaseGenome analysisProgram developmentScientific computingExploratory data analysisData and text miningIllumina HiSeqRead mappingLIMSIllumina MiSeqHigh Throughput ScreeningMultidimensional data analysisWorkflow and pipeline developmentRibosome profilingMotifs and patterns detection
      Organisms

      Projects (10)

      Fabien MAREUIL

      Group : WINTER - Hub Core

      After a Master degree in Genome Analysis and Molecular Modeling at Denis Diderot University, I did a PhD in NMR / bioinformatics at Denis Diderot University, where I worked on the development and use of a software named DaDiModO which uses SAXS data and RDC/NMR data to calculate models of structural proteins. After a postdoc aiming to adapt ARIA software to allow execution on computing grid in the Structural Bioinformatic Team at Institut Pasteur in collaboration with IBCP, I joined CIB/DSI Team where I was responsible for the development of bioinformatics projects and the deployment, maintenance and evolution of the Pasteur Galaxy server. I joined the Hub/C3BI team in 2017 as research engineer where I’m involved in several projects such as structural bioinformatics, softwares and web development. I am also in charge of the maintenance of the Galaxy Pasteur instance.


      Keywords
      Data managementStructural bioinformaticsDatabaseProgram developmentScientific computingDatabases and ontologiesGrid and cloud computing
      Organisms
      Non applicable
      Projects (9)

      Bertrand NÉRON

      Group : ALPS - Hub Core

      Activities Contact for any subject related to IFB. Help scientists to develop new tools (architecture, design, implementation). animate the Python Working Group at pasteur . O|B|F (http://www.open-bio.org/) member. Skills Strong programming experience in Python. Software architecture and design. NoSQL DataBase (MongoDB, CouchDB) XML/YAML continuous integration (github/travis-CI/readthedocs, gitlab/gitlab-CI) containers (Docker, Singularity) linux (Gentoo, Xubuntu) IFB developer Main projects on the campus Mobyle http://Mobyle.pasteur.fr Mobyle: a new full web bioinformatics framework IntegronFinder (ongoing project) MacsyFinder (ongoing project) githubaccess to my projects on github Teaching Unix (Unix-I , Unix-II) Python . Education 2002 Phd in Molecular and cellular biology. “Rôle de deux protéines QN1 et PATF impliquées dans l’arrêt de prolifération des cellules de la neurorétine aviaire au cours du developpement”. 2001 “Informatique En Biologie” course (Pasteur)


      Keywords
      Data managementDatabaseProgram developmentScientific computingDatabases and ontologies
      Organisms
      Non applicable
      Projects (11)

      Thomas OBADIA


      Thomas is a biostatistician who holds an engineering degree in Agronomy (Agrocampus Ouest, Rennes, France). He also holds a Ph.D. in biostatistics from Université Pierre et Marie Curie for his work on the spread of nosocomial pathogens on contact networks. During his Ph.D at INSERM, he investigated how high-resolution dynamical contact data could support infection-tracing conducted using more traditional approaches in healthcare settings, e.g. routine swabbing and genetic characterization of strains detected in patients or healthcare workers. He developed a new statistical framework to test the correlation between dynamic close-proximity interaction networks and biological carriage data. While at INSERM, he also developed the R0 package for R that aimed at implementing several computation methods used in estimating reproduction parameters for emerging transmissible diseases. After working as a statistical modeller for a private company in the pharmaceutical industry, he joined the Hub in 2016 as a statistician and is now involved in the projects of the Malaria: parasites and hosts unit headed by Ivo Mueller.


      Keywords
      ModelingBiostatisticsScientific computingApplication of mathematics in sciencesClinical researchEpidemiology and public health
      Organisms

      Projects (3)

      Natalia PIETROSEMOLI

      Group : SysBio - Hub Core

      Dr. Natalia Pietrosemoli is an Engineer with a M. Sc. in Modeling and Simulation of Complex Realities from the International Center for Theoretical Physics, ICTP and the International School of Advanced Studies, SISSA (Triest, Italy). During her M. Sc. internships she mostly worked in modeling, optimization, combinatorics and information theory applied to medical imaging. In 2012 she got a Ph. D in Computational Biology from the School of Bioengineering of Rice University (Houston, TX, US), where she specialized in computational structural biology and functional genomics. Her doctoral thesis “Protein functional features extracted with from primary sequences : a focus on disordered regions”, contributed to a better understanding of the functional and evolutionary role of intrinsic disorder in protein plasticity, complexity and adaptation to stress conditions. As part of her Ph. D., Natalia was a visiting scholar in two labs in Madrid: the Structural Computational Biology Group at the Spanish National Cancer Research Centre (CNIO), where she mainly worked in sequence analysis and the functional-structural relationships of proteins, and the Computational Systems Biology Group at the Spanish National Centre for Biotechnology (CNB-CSIC ), where she studied the functional implications of intrinsically disordered proteins at the genomic level for several organisms, collaborating with different experimental and theoretical groups. In 2013, she joined the Swiss Institute of Bioinformatics as a postdoctoral fellow in the Bioinformactics Core Facility. Her main project consisted in the molecular classification of a rare type of lymphoma, which involved the integration of transcriptomic, clinical and mutational data for the identification of molecular markers for classification, diagnosis and prognosis. This work was performed in collaboration with the Pathology Institute at the University Hospital of Lausanne (CHUV). In November of 2015 Natalia joined the Hub Team @ Pasteur C3BI as a Senior Bioinformatician. Natalia is especially interested in the integrative analysis of different omics data, both at large-scale and for small datasets, and loves collaborating in interdisciplinary environments and having feedback from her fellow experimental colleagues. Currently, she’s coordinating several projects performing functional and pathway analysis at the genomic level. By grouping genes, proteins and other biological molecules into the pathways they are involved in, the complexity of the analyses is significantly reduced, while the explanatory power increases with respect to having a list of differentially expressed genes or proteins.


      Keywords
      AlgorithmicsData managementGenomicsImage analysisMachine learningModelingProteomicsSequence analysisStructural bioinformaticsTranscriptomicsDatabaseGenome analysisBiostatisticsScientific computingDatabases and ontologiesApplication of mathematics in sciencesData and text miningGeneticsGraphics and Image ProcessingBiosensors and biomarkersClinical researchCell biology and developmental biologyInteractomicsBioimage analysis
      Organisms

      Projects (28)

      Rachel TORCHET

      Group : WINTER - Hub Core

      In 2012 I completed my master degree at the MicroScope Platform located at Genoscope (the French National Sequencing Center). I was involved in a project aiming at the management of evolution projects which rely on the Next Generation Sequencing (NGS) technologies to try to decipher the dynamics of genomic changes as well as the molecular bases and the mechanisms underlying adaptative evolution of micro-organisms (Remigi et al. 2014). Since November 2014, I joined the Bioinformatics and Biostatistics HUB at Institut Pasteur. I participated to the creation and updates of the C3BI website. I joined the WINTER group where I’m in charge of web and interface development projects. I have completed an UX-Design training to add extra value to my front-end development skills. I design and develop bioinformatics tools and interfaces that are users oriented.


      Keywords
      Data VisualizationWeb developmentDatabaseGenome analysisScientific computingDatabases and ontologiesSofware development and engineeringWorkflow and pipeline development
      Organisms

      Projects (5)

      Hugo VARET

      Group : PLATEFORM - Detached : Biomics

      Hugo Varet is a biostatistician engineer from the Ensai (Ecole Nationale de la Statistique et de l’Analyse de l’Information) and has been recruited by the hub of the C3BI (Center of Bioinformatics, Biostatistics and Integrative Biology) to work at the Transcriptome & Epigenome Platform. He is in charge of the statistical analyses of the RNA-Seq data produced by the platform and develops R pipelines that help in this task. One of them is named SARTools and is available on GitHub: https://github.com/PF2-pasteur-fr/SARTools.


      Keywords
      ModelingSequence analysisStatistical inferenceTranscriptomicsBiostatisticsScientific computingApplication of mathematics in sciencesExploratory data analysisHigh Throughput ScreeningClinical research
      Organisms

      Projects (17)