Laboratory of Bioinformatics and Systems Biology

Head of Laboratory:Ulykbek Kairov 

Research areas:

Laboratory of Bioinformatics and Systems Biology carries out research in the following directions:

  • Bioinformatics analysis of high-throughput genomic data from the next-generation sequencing platforms (Illumina HiSeq2000/MiSeq, Roche 454) and third generation sequencing (Oxford Nanopore).
  • Bioinformatics analysis and meta-analysis of tumor transcriptomes from high-throughput genomic platforms (Affymetrix/Illumina).
  • Human whole-genome and whole-exome analysis, development of Kazakh reference database and identification of specific genomic variants. 
  • Bioinformatics analysis of cancer transcriptome and gene expression data from next-generation sequencing platform Illumina HiSeq2000 / MiSeq.
  • Development of new integrative approaches and methods for analysis of human whole transcriptomes and whole genomes.
  • Development of computational bioinformatics facility and biomedical databases based on research results



Laboratory of Bioinformatics and Computational Systems Biology was established at the Center for Life Sciences, Nazarbayev University. The mission of the Lab of Bioinformatics and Computational Systems Biology is the development of Kazakhstani bioinformatics and systems biology by performing research, improving facility, establishing collaboration with the leading research groups as well as staff training. Scientific research of Laboratory focused on the study of biological processes of living organisms, using modern methods of bioinformatics and systems biology. The main research objects:

  1. Human whole-genome/whole exome
  2. Human whole-transcriptomes and cancer transcriptomes
  3. Bacterial whole-genomes (M.tuberculosis, N.meningitidis and others)
  4. Plant whole-transcriptomes


Currently, Laboratory actively collaborates with the following research centers:

  • Institute Curie, Paris, France (study of cancer diseases).
  • Harvard Medical School, USA (study of non-coding genomic variants).
  • Genomic Medicine Institute, Korea (human genome research).
  • Institute of Physical-Chemical Medicine, Moscow, Russia (study of bacterial genomes and microbiomes, development of biological databases).
  • University of Southern California, USA (study of alternative splicing).
  • Nazarbayev University SST and SENG (bioinformatics)
  • Al-Farabi Kazakh National University (plant transcriptomes)


Main research and bioinformatics facility:

  1. High-performance bioinformatics cluster “Q-Symphony”: Hewlett-Packard Enterprise – Apollo (208 Cores x Intel Xeon, 3.26 TB RAM, 258 ТB RAID HDD, RedHat Linux) – max computing performance 7.5 TFlops: specifically designed architecture optimized for bioinformatics research and analysis of big genomics datasets (whole-genome/whole transcriptomes datasets and genomics bulk datasets more than 100 samples simultaneously). 


  1. Small bioinformatics cluster Dell PowerEdge Cluster (64 x Cores AMD Opteron, 128 Gb RAM, 20 Tb RAID)  – high-performance server capable to perform tasks on analysis of small sample size next-generation sequencing data (human whole-genome, whole-exomes, whole transcriptomes analysis, up to 10 samples simultaneously).


  1. Server Dell PowerEdge (2x Intel Xeon (12 Cores), 64 Gb RAM, 8 Tb RAID, Windows Server) – medium-performance server on Windows platform for developing Windows-based bioinformatics methods and Matlab codes application.


  1. Сервер HP Z800 (2х Intel Xeon (16 Cores), 48 Gb RAM) – medium-performance server for performing bioinformatics analysis of viral and bacterial genomes.


  1. Synology Storage System (50 Tb Raid) – archiving server for massive storage and biomedical data encryption.