Statistics & Big Data analysis

This Research Foundation, created in October 2013, aims to provide the statistical and computational knowledge to clinical & biological researchers in the analysis of large-scale data. We are also responsible for building and maintaining a computer infrastructure to implement effective solutions when analyze, visualize and interpret data.

The unit also aims to train and encourage knowledge acquisition, by offering courses and seminars, of new computational analysis techniques that allow better and deeper biological interpretation of the results achieved. The Unit comprises Biologists, Bioinformatics and computational Engineers. To ensure potential and scalable large and demanding computing capabilities, we provide the following specifications: 1) Computational infrastructure: The Computational Biology and Bioinformatics group hosts a High Performance Scientific Computing (HPSC) machine with 4 computing nodes (2 processors of 10 cores each), 4 computing nodes (2 processors of 8 cores each) and 1 storing node (2 processors of 8 cores each), and 21 terabytes of total size. The local network between the nodes is connected through Infiniband FDR, a low latency network of 56gb/sec.

The cluster has a total amount of 832 Gb of RAM memory (DDR3 1867 Mhz ECC). 2) Storage infrastructure: Features include a high efficient and secure storage system built upon 3 EMC Isilion nodes. These 3 NL400 GRID nodes represent a single volume in a single file system of a total size of 194.8TBs (redundancy N +2:1). It has 3 active controllers with 36GB distributed global memory and 24 processing cores and infiniband connectors for internal subnets. It also has a data self-balancing system with customizable auto-balance system policy and automatic failover in environments NFS.