Yi-Yuan Chiu | National Chiao Tung University (original) (raw)

Papers by Yi-Yuan Chiu

Research paper thumbnail of Predicting helix-helix interactions from residue contacts in membrane proteins

Bioinformatics/computer Applications in The Biosciences, 2009

Single residue and residue pair contact propensities were estimated using a beta-binomial model. ... more Single residue and residue pair contact propensities were estimated using a beta-binomial model. Let i denote the residue type. The data then consist of values n i and N i for each i indicating that there are n i contacts out of a total of N i possible contacts.

Research paper thumbnail of TMPad: an integrated structural database for helix-packing folds in transmembrane proteins

Nucleic Acids Research, 2011

a-Helical transmembrane (TM) proteins play an important role in many critical and diverse biologi... more a-Helical transmembrane (TM) proteins play an important role in many critical and diverse biological processes, and specific associations between TM helices are important determinants for membrane protein folding, dynamics and function. In order to gain insights into the above phenomena, it is necessary to investigate different types of helix-packing modes and interactions. However, such information is difficult to obtain because of the experimental impediment and a lack of a well-annotated source of helix-packing folds in TM proteins. We have developed the TMPad (TransMembrane Protein Helix-Packing Database) which addresses the above issues by integrating experimentally observed helix-helix interactions and related structural information of membrane proteins. Specifically, the TMPad offers pre-calculated geometric descriptors at the helix-packing interface including residue backbone/side-chain contacts, interhelical distances and crossing angles, helical translational shifts and rotational angles. The TMPad also includes the corresponding sequence, topology, lipid accessibility, ligand-binding information and supports structural classification, schematic diagrams and visualization of the above structural features of TM helix-packing. Through detailed annotations and visualizations of helix-packing, this online resource can serve as an information gateway for deciphering the relationship between helix-helix interactions and higher levels of organization in TM protein structure and function. The website of the TMPad is freely accessible to the public at

Research paper thumbnail of KIDFamMap: a database of kinase-inhibitor-disease family maps for kinase inhibitor selectivity and binding mechanisms

Nucleic Acids Research, Nov 28, 2012

Kinases play central roles in signaling pathways and are promising therapeutic targets for many d... more Kinases play central roles in signaling pathways and are promising therapeutic targets for many diseases. Designing selective kinase inhibitors is an emergent and challenging task, because kinases share an evolutionary conserved ATP-binding site. KIDFamMap (http://gemdock.life.nctu.edu.tw/KIDFamMap/) is the first database to explore kinase-inhibitor families (KIFs) and kinase-inhibitor-disease (KID) relationships for kinase inhibitor selectivity and mechanisms. This database includes 1208 KIFs, 962 KIDs, 55 603 kinase-inhibitor interactions (KIIs), 35 788 kinase inhibitors, 399 human protein kinases, 339 diseases and 638 disease allelic variants. Here, a KIF can be defined as follows: (i) the kinases in the KIF with significant sequence similarity, (ii) the inhibitors in the KIF with significant topology similarity and (iii) the KIIs in the KIF with significant interaction similarity. The KIIs within a KIF are often conserved on some consensus KIDFamMap anchors, which represent conserved interactions between the kinase subsites and consensus moieties of their inhibitors. Our experimental results reveal that the members of a KIF often possess similar inhibition profiles. The KIDFamMap anchors can reflect kinase conformations types, kinase functions and kinase inhibitor selectivity. We believe that KIDFamMap provides biological insights into kinase inhibitor selectivity and binding mechanisms.

Research paper thumbnail of GEMSCORE: A New Empirical Energy Function for Protein Folding

We have developed a new energy function, termed GEMSCORE, for the protein structure prediction, w... more We have developed a new energy function, termed GEMSCORE, for the protein structure prediction, which is an emergent problem in the field of computational structural biology. The GEMSCORE combines knowledge-based and physics-based energy functions. Instead of hundreds and thousands parameters used in many physics-based energy functions, we optimized nine weights of energy terms in the GEMSCORE by using a generic evolutionary method. These nine energy terms are the electrostatic, the der Waals, the hydrogen-bonding potential, and six terms for solvation potentials. The GEMSCORE has been evaluated on six decoy sets, including 96 proteins with more 70,000 structures. The result indicates that our method is able to successfully identify 74 native proteins from these 96 proteins. Our GEMSCORE is fast and simple to discriminate between native and nonnative structures from thousands of protein structure candidates in these decoy sets. We believe that the GEMSCORE is robust and should be a useful energy function for the protein structure prediction.

Research paper thumbnail of Soft energy function and generic evolutionary method for discriminating native from nonnative protein conformations

Journal of Computational Chemistry, 2008

We have developed a soft energy function, termed GEMSCORE, for the protein structure prediction, ... more We have developed a soft energy function, termed GEMSCORE, for the protein structure prediction, which is one of emergent issues in the computational biology. The GEMSORE consists of the van der Waals, the hydrogen-bonding potential and the solvent potential with 12 parameters which are optimized by using a generic evolutionary method. The GEMSCORE is able to successfully identify 86 native proteins among 96 target proteins on six decoy sets from more 70,000 near-native structures. For these six benchmark datasets, the predictive performance of the GEMSCORE, based on native structure ranking and Z-scores, was superior to eight other energy functions. Our method is based solely on a simple and linear function and thus is considerably faster than other methods that rely on the additional complex calculations. In addition, the GEMSCORE recognized 17 and 2 native structures as the first and the second rank, respectively, among 21 targets in CASP6 (Critical Assessment of Techniques for Protein Structure Prediction). These results suggest that the GEMSCORE is fast and performs well to discriminate between native and nonnative structures from thousands of protein structure candidates. We believe that GEMSCORE is robust and should be a useful energy function for the protein structure prediction.

Research paper thumbnail of An Evolutionary Approach with Pharmacophore-Based Scoring Functions for Virtual Database Screening

We have developed a new tool for virtual database screening. This tool, referred to as the Generi... more We have developed a new tool for virtual database screening. This tool, referred to as the Generic Evolutionary Method for molecular DOCKing (GEMDOCK), combines an evolutionary approach and a new pharmacophore-based scoring function. The former integrates discrete and continuous global search strategies with local search strategies to speed up convergence. The latter simultaneously serves as the scoring function of both molecular docking and post-docking analysis to improve the number of the true positives. We accessed the accuracy of our approach on HSV-1 thymidine kinase using a ligand database on which competing tools were evaluated. The accuracies of our predictions were 0.54 for the GH score and 1.62% for the false positive rate when the true positive rate was 100%. We found that our pharmacophore-based scoring function indeed is able to reduce the number of the false positives. These results suggest that GEMDOCK is robust and can be a useful tool for virtual database screening.

Research paper thumbnail of Predicting helix-helix interactions from residue contacts in membrane proteins

Bioinformatics/computer Applications in The Biosciences, 2009

Single residue and residue pair contact propensities were estimated using a beta-binomial model. ... more Single residue and residue pair contact propensities were estimated using a beta-binomial model. Let i denote the residue type. The data then consist of values n i and N i for each i indicating that there are n i contacts out of a total of N i possible contacts.

Research paper thumbnail of TMPad: an integrated structural database for helix-packing folds in transmembrane proteins

Nucleic Acids Research, 2011

a-Helical transmembrane (TM) proteins play an important role in many critical and diverse biologi... more a-Helical transmembrane (TM) proteins play an important role in many critical and diverse biological processes, and specific associations between TM helices are important determinants for membrane protein folding, dynamics and function. In order to gain insights into the above phenomena, it is necessary to investigate different types of helix-packing modes and interactions. However, such information is difficult to obtain because of the experimental impediment and a lack of a well-annotated source of helix-packing folds in TM proteins. We have developed the TMPad (TransMembrane Protein Helix-Packing Database) which addresses the above issues by integrating experimentally observed helix-helix interactions and related structural information of membrane proteins. Specifically, the TMPad offers pre-calculated geometric descriptors at the helix-packing interface including residue backbone/side-chain contacts, interhelical distances and crossing angles, helical translational shifts and rotational angles. The TMPad also includes the corresponding sequence, topology, lipid accessibility, ligand-binding information and supports structural classification, schematic diagrams and visualization of the above structural features of TM helix-packing. Through detailed annotations and visualizations of helix-packing, this online resource can serve as an information gateway for deciphering the relationship between helix-helix interactions and higher levels of organization in TM protein structure and function. The website of the TMPad is freely accessible to the public at

Research paper thumbnail of KIDFamMap: a database of kinase-inhibitor-disease family maps for kinase inhibitor selectivity and binding mechanisms

Nucleic Acids Research, Nov 28, 2012

Kinases play central roles in signaling pathways and are promising therapeutic targets for many d... more Kinases play central roles in signaling pathways and are promising therapeutic targets for many diseases. Designing selective kinase inhibitors is an emergent and challenging task, because kinases share an evolutionary conserved ATP-binding site. KIDFamMap (http://gemdock.life.nctu.edu.tw/KIDFamMap/) is the first database to explore kinase-inhibitor families (KIFs) and kinase-inhibitor-disease (KID) relationships for kinase inhibitor selectivity and mechanisms. This database includes 1208 KIFs, 962 KIDs, 55 603 kinase-inhibitor interactions (KIIs), 35 788 kinase inhibitors, 399 human protein kinases, 339 diseases and 638 disease allelic variants. Here, a KIF can be defined as follows: (i) the kinases in the KIF with significant sequence similarity, (ii) the inhibitors in the KIF with significant topology similarity and (iii) the KIIs in the KIF with significant interaction similarity. The KIIs within a KIF are often conserved on some consensus KIDFamMap anchors, which represent conserved interactions between the kinase subsites and consensus moieties of their inhibitors. Our experimental results reveal that the members of a KIF often possess similar inhibition profiles. The KIDFamMap anchors can reflect kinase conformations types, kinase functions and kinase inhibitor selectivity. We believe that KIDFamMap provides biological insights into kinase inhibitor selectivity and binding mechanisms.

Research paper thumbnail of GEMSCORE: A New Empirical Energy Function for Protein Folding

We have developed a new energy function, termed GEMSCORE, for the protein structure prediction, w... more We have developed a new energy function, termed GEMSCORE, for the protein structure prediction, which is an emergent problem in the field of computational structural biology. The GEMSCORE combines knowledge-based and physics-based energy functions. Instead of hundreds and thousands parameters used in many physics-based energy functions, we optimized nine weights of energy terms in the GEMSCORE by using a generic evolutionary method. These nine energy terms are the electrostatic, the der Waals, the hydrogen-bonding potential, and six terms for solvation potentials. The GEMSCORE has been evaluated on six decoy sets, including 96 proteins with more 70,000 structures. The result indicates that our method is able to successfully identify 74 native proteins from these 96 proteins. Our GEMSCORE is fast and simple to discriminate between native and nonnative structures from thousands of protein structure candidates in these decoy sets. We believe that the GEMSCORE is robust and should be a useful energy function for the protein structure prediction.

Research paper thumbnail of Soft energy function and generic evolutionary method for discriminating native from nonnative protein conformations

Journal of Computational Chemistry, 2008

We have developed a soft energy function, termed GEMSCORE, for the protein structure prediction, ... more We have developed a soft energy function, termed GEMSCORE, for the protein structure prediction, which is one of emergent issues in the computational biology. The GEMSORE consists of the van der Waals, the hydrogen-bonding potential and the solvent potential with 12 parameters which are optimized by using a generic evolutionary method. The GEMSCORE is able to successfully identify 86 native proteins among 96 target proteins on six decoy sets from more 70,000 near-native structures. For these six benchmark datasets, the predictive performance of the GEMSCORE, based on native structure ranking and Z-scores, was superior to eight other energy functions. Our method is based solely on a simple and linear function and thus is considerably faster than other methods that rely on the additional complex calculations. In addition, the GEMSCORE recognized 17 and 2 native structures as the first and the second rank, respectively, among 21 targets in CASP6 (Critical Assessment of Techniques for Protein Structure Prediction). These results suggest that the GEMSCORE is fast and performs well to discriminate between native and nonnative structures from thousands of protein structure candidates. We believe that GEMSCORE is robust and should be a useful energy function for the protein structure prediction.

Research paper thumbnail of An Evolutionary Approach with Pharmacophore-Based Scoring Functions for Virtual Database Screening

We have developed a new tool for virtual database screening. This tool, referred to as the Generi... more We have developed a new tool for virtual database screening. This tool, referred to as the Generic Evolutionary Method for molecular DOCKing (GEMDOCK), combines an evolutionary approach and a new pharmacophore-based scoring function. The former integrates discrete and continuous global search strategies with local search strategies to speed up convergence. The latter simultaneously serves as the scoring function of both molecular docking and post-docking analysis to improve the number of the true positives. We accessed the accuracy of our approach on HSV-1 thymidine kinase using a ligand database on which competing tools were evaluated. The accuracies of our predictions were 0.54 for the GH score and 1.62% for the false positive rate when the true positive rate was 100%. We found that our pharmacophore-based scoring function indeed is able to reduce the number of the false positives. These results suggest that GEMDOCK is robust and can be a useful tool for virtual database screening.