Develop computational methods to segment multi-domain proteins

It can be hard to tell with certainty which single entity carries the function or if it��s some sort of combination thereof. Inter-domain similarities regions of high similarity not functionally related due to incomplete annotation, especially in the 3.5 to 5.0 bit score range, could be part of the reason for the high degree of variability in the relationship between bit score and RIC. In the future we intend to refine and develop computational methods to segment multi-domain proteins into their functional components in a reliable way. This may enable us to create much larger goldstandard data sets, with lower variability, with which to further develop, improve, and evaluate our model for protein functional annotation. Proteins annotated with GO terms were identified in the RefSeq and Uniprot databases. Of these, only annotations attributed by GO evidence code ����IDA����, indicating experimental evidence, were selected. The proteins from RefSeq and Uniprot were kept in two Byakangelicin separate data sets which made up our training and test data sets respectively. This technique is known as split-sample model validation and is a robust method of model selection and validation. To ensure independence between our data sets, proteins from the test set determined to be identical or subsequences of proteins in the training set were removed from the test set. The hepatitis C virus is a human pathogen responsible for acute and Garcinone-C chronic liver disease, infecting an estimated 130�C170 million individuals worldwide. Its variants are classified into six genotypes. In addition to cirrhosis and hepatocellular carcinoma, HCV induces several complications, including steatosis, dyslipidemia and insulin resistance. Accordingly, It is now considered to cause metabolic alterations instead of being simply a viral infection. In particular, much of the HCV life cycle including the entry into na? ��ve cells, infectivity, RNA replication, viral assembly and viral secretion closely associated with lipid metabolism. The combination of pegylated interferon -a plus ribavirin has provided a ��cure�� for a considerable proportion of patients with chronic hepatitis C, particularly when most patients have the favorable interleukin-28B genotype.