bulletCover Story
Big data analysis reveals gene interactions in cancer
Big data analysis reveals gene interactions
in cancer

PolyU researchers have developed a novel big data platform for analysing gene interactions in cancer, which was not possible in previous observation. It has helped identify potential diagnostic and therapeutic target genes in leukemia.

  • Research team members (from left): Dr Cesar Wong, Prof. Benjamin Yung and Dr Lawrence Chan

    Research team members (from left): Dr Cesar Wong, Prof. Benjamin Yung and Dr Lawrence Chan

  • Researcher pipettes blood samples.

    Researcher pipettes blood samples.

 

One of the medical conditions not yet conquered by science is cancer, that group of diseases brought on by abnormal changes in genes. While cancer research is being given top priority worldwide, finding the genes responsible is a daunting task given that the human genome features more than 20,000 protein-coding genes.

Conventional method of identifying cancer gene markers focuses on changes in individual genes across the entire genome when cancer patients and healthy people are compared. The interactions among genes and their systemic changes that compose the complicated cancer mechanism are unfortunately neglected. The probable reason for this is that combinations of gene pairs increase geometrically with the number of genes of interest, so analysis could involve about 200 million possible pairs.

Moving away from the limitations of convention, a research team from the Department of Health Technology and Informatics at PolyU has developed a groundbreaking big data analysis platform that detects gene pair interactions in cancer with incomparable effectiveness. Critical to the team's success was its multidisciplinary nature, with a range of perspectives offered by its leaders. Prof. Benjamin Yung is a renowned biomedical scientist with specific expertise in cancer biology, trained and inspired by Prof. Arthur Kornberg, 1959 Nobel Laureate in Physiology or Medicine. Dr Cesar Wong is a clinical scientist specialising in cancer genomics and biomarker development related to cancers common in Asia. Dr Lawrence Chan takes a mathematical focus, with expertise in bioinformatics and artificial intelligence control.

Together, the trio brought their range of skills to bear on developing the platform, which can analyse the interactions of those 200 million gene pairs in only two days – an unprecedented scientific foundation for the analysis of cancer-specific difference in co-expression. Conventional method looking at individual genes would require 200 million experiments, which would be an almost infeasible undertaking.

Focusing on Chronic Myelogenous Leukemia (CML), a bone marrow cancer that appears most often later in life, the research team hypothesised that genes participating in the same cancer mechanism would be co-expressed. In other words, their levels of expression would be related to each other. That, in turn, would allow the team to detect gene co-expression networks and help them to decipher the mechanism underlying the cancer.

The team achieved this with help from biostatistics experts at Harvard University, who worked with them to perform a large-scale computer simulation that established a feasible method for analysing cancer-specific differences in co-expression. That again is indicative of how valuable a multidisciplinary approach can be, and of how important cross-institutional collaboration in areas such as chemistry, pathology and biostatistics is to scientific advances.