About
CV
Resume (english) CV (english)
Research
GRN Inference Bio-Inspired Subspace Clustering EvoMove: Musical Personal Companion EvoWave: Wi-Fi context analysis Socialist Discourse Analysis Data Mining For Chemo-Informatics
Teaching
Linear Algebra Stochastic Process Graph Theory Python Data Bases Software Development Software Deployment GRN Inference String Search Algorithms Writing Scientific Paper
Software
EvoWave_Chameleoclust package SubCMedians Knime package SubCMedians Python package GReNaDIne Python package GXN Python package
Side Projects

Data mining applied to chemo-informatics

Clustering of chemical compounds

In order to assess the Chameleoclust+ algorithm we decided to use it in a more practical application. Chameleoclust was used to clusterize a set of chemical molecules described in a high dimensional space of physical and chemical descriptors. This work has been carried out in collaboration with the Theoretical Chemistry team of the Universidad Mayor de San Andres. Readers are refered to Peignier and Cantañeta 2015 for a detailled description of this work (spanish version only).

Kernels for graph comparisons

The development of new drugs is a expensive and slow procedure. An important step in new molecules development is the “test” phase. This particularly slow and expensive step is significantly accelerated and cheapened if Structure-Activity Relationship Analysis (SAR) is applied to the tested chemical compounds. We studied here how to predict molecular activity with graph Kernels. Kernels4graphs implements several kernel based methods for classification of graphs. A basic SVM algorithm and 4 different graph Kernels implementations (Nth order walk kernel, Geometric walk kernel, Markovian random walk kernel and Subtree kernel) are provided. The different approaches have been applied to Structure-Activity Relationship Analysis (SAR) to predict the molecular activity of differente chemical compounds described as labeled graphs. This project can be found in its github repository.

Data mining applied to chemo-informatics

Clustering of chemical compounds

Kernels for graph comparisons

Secondary structure RNA prediction