Skip to main content
Chapter 10 of 12
NCERT Solutions

Protein Informatics and Cheminformatics

CBSE · Class 11 · Biotechnology

NCERT Solutions for Protein Informatics and Cheminformatics — CBSE Class 11 Biotechnology.

Interactive on Super Tutor

Studying Protein Informatics and Cheminformatics? Get the full interactive chapter.

Quizzes, flashcards, AI doubt-solver and a step-by-step study plan — built for ncert solutions and more.

1,000+ Class 11 students started this chapter today

6 Questions Solved · 1 Section

EXERCISES — Protein Informatics and Cheminformatics

1What is the role of information technology in determination of protein properties?Show solution
Given/Concept: Information technology (IT) provides computational tools and databases that allow scientists to analyse protein sequences and structures without performing every experiment in the laboratory.

Answer:

Information technology plays a crucial role in the determination of protein properties in the following ways:

1. Sequence Analysis: IT tools allow rapid analysis of amino acid sequences to determine physicochemical properties such as molecular weight, isoelectric point (pI), instability index, aliphatic index, and GRAVY (Grand Average of Hydropathy) value. Servers like ProtParam (ExPASy) perform these calculations automatically from raw sequence data.

2. Secondary Structure Prediction: Tools such as APSSP, CPSSP, SOPMA, and GOR predict the secondary structural elements (α-helices, β-sheets, coils) of a protein from its primary sequence.

3. Domain and Motif Prediction: IT-based tools (e.g., PROSITE, Pfam, InterPro) identify functional domains and conserved motifs within a protein sequence.

4. 3D Structure Prediction: Computational methods like homology modelling, fold prediction, and de novo prediction use IT to build three-dimensional models of proteins whose structures have not been experimentally determined.

5. Database Management: Large biological databases (UniProt, PDB, NCBI) store and organise vast amounts of protein data, making it accessible for analysis worldwide.

Conclusion: Thus, information technology accelerates protein characterisation, reduces experimental cost and time, and enables large-scale proteomics studies.
2What type of protein raw data is used for computationally extracting information about the protein?Show solution
Given/Concept: Computational analysis of proteins begins with raw data that is fed into bioinformatics tools and servers.

Answer:

The primary types of protein raw data used for computationally extracting information are:

1. Amino Acid Sequence (Primary Sequence Data): The most fundamental raw data is the linear sequence of amino acids (in single-letter or three-letter code) obtained from gene sequencing or direct protein sequencing. This is used to calculate physicochemical properties, predict secondary structure, identify domains, and perform homology modelling.

2. Nucleotide Sequence Data: The coding DNA/mRNA sequence can be translated in silico to obtain the protein sequence, which is then used for further analysis.

3. 3D Coordinate Data (PDB files): Experimentally determined three-dimensional atomic coordinates from X-ray crystallography, NMR spectroscopy, or cryo-EM, stored in Protein Data Bank (PDB) format, serve as raw data for structural analysis and comparison.

4. Mass Spectrometry Data: Peptide mass fingerprinting data from mass spectrometry is used to identify proteins and post-translational modifications.

Conclusion: Among all these, the amino acid sequence is the most commonly used raw data for computational extraction of protein information, as it is the starting point for most bioinformatics analyses.
3Name any two common tools for domain prediction.Show solution
Given/Concept: Domain prediction tools identify conserved functional or structural regions within a protein sequence.

Answer:

Two common tools used for domain prediction are:

1. PROSITE — A database and tool that identifies protein domains, families, and functional sites using patterns and profiles derived from known protein sequences.

2. Pfam — A large database of protein families and domains. It uses Hidden Markov Models (HMMs) to detect conserved domains in a query protein sequence.

Additional examples (for reference): InterPro, SMART, CDD (Conserved Domain Database).

Conclusion: PROSITE and Pfam are two widely used bioinformatics tools for predicting and identifying functional domains in proteins.
4What is the significance of cheminformatics?Show solution
Given/Concept: Cheminformatics combines computational and informational techniques to solve chemistry-related problems, especially in drug discovery and chemical research.

Answer:

The significance of cheminformatics is as follows:

1. Drug Discovery and Design: Cheminformatics helps in identifying, designing, and optimising potential drug molecules by analysing chemical structures, properties, and biological activities. It enables virtual screening of large chemical libraries.

2. Pharmacophore Modelling: It provides a description of the molecular features (pharmacophore) required for a ligand to interact with a biological target, aiding in the design of new drugs.

3. Prediction of Molecular Properties: Physical and chemical properties such as solubility, lipophilicity (logP\log P), molecular weight, and hydrogen bonding capacity can be predicted computationally, saving time and resources.

4. Application of Lipinski's Rule of Five (RO5): Cheminformatics tools apply rules like RO5 to filter and select drug-like compounds with good oral bioavailability from large chemical databases.

5. Management of Chemical Data: It organises and manages large amounts of chemical information including 3D molecular crystal structures, reaction pathways, and spectral data in searchable databases.

6. Reduction of Experimental Cost: By predicting the behaviour of molecules computationally, cheminformatics reduces the need for expensive and time-consuming laboratory experiments.

Conclusion: Cheminformatics is significant because it accelerates the drug discovery process, improves the efficiency of chemical research, and bridges the gap between chemistry and information technology.
5Which of the following is not a rule in Lipinski's rule of five (RO5)?
(a) No more than 10 hydrogen bond receptors
(b) Partition coefficient log P of less than 5
(c) Not more than 5 hydrogen bond donors
(d) Molecular weight above 500 g/mol
Show solution
Correct Answer: (d) Molecular weight above 500 g/mol

Justification:

Lipinski's Rule of Five (RO5) states the following criteria for a drug-like compound with good oral bioavailability:

| Rule | Criterion |
|------|-----------|
| 1 | Molecular weight not more than (≤) 500 g/mol |
| 2 | Partition coefficient logP\log P less than 5 |
| 3 | Not more than 5 hydrogen bond donors |
| 4 | Not more than 10 hydrogen bond acceptors (receptors) |

Option (d) states "Molecular weight above 500 g/mol," which is the opposite of the actual rule. According to RO5, the molecular weight should be 500 g/mol or less, not above 500 g/mol. Therefore, option (d) is not a rule in RO5.

Options (a), (b), and (c) are all correctly stated rules of RO5.

Answer: (d) Molecular weight above 500 g/mol\boxed{\text{Answer: (d) Molecular weight above 500 g/mol}}
6Which of the following properties of protein is not included in primary structure prediction?
(a) Aliphatic index
(b) Fold prediction
(c) Instability index
(d) Isoelectric point
Show solution
Correct Answer: (b) Fold prediction

Justification:

Primary structure prediction (also called physicochemical characterisation) involves calculating properties directly from the amino acid sequence of a protein. These include:

- Aliphatic index — a measure of the relative volume occupied by aliphatic side chains; calculated from sequence. ✓
- Instability index — predicts the stability of a protein in a test tube; calculated from sequence. ✓
- Isoelectric point (pI) — the pH at which the protein carries no net charge; calculated from sequence. ✓

Fold prediction, on the other hand, is a method used to predict the three-dimensional (3D) structure of a protein. It belongs to tertiary structure prediction, not primary structure prediction. Fold prediction uses computational methods to identify which known structural fold a protein sequence is likely to adopt.

Therefore, fold prediction is not included in primary structure prediction.

Answer: (b) Fold prediction\boxed{\text{Answer: (b) Fold prediction}}

Stuck on a step?

Ask Super Tutor AI to explain any solution on this page in a simpler way — free, 24x7.

Ask a Doubt Free

Frequently Asked Questions

What are the important topics in Protein Informatics and Cheminformatics for CBSE Class 11 Biotechnology?
Protein Informatics and Cheminformatics covers several key topics that are frequently asked in CBSE Class 11 board exams. Focus on the core concepts listed on this page and practise related questions to build confidence.
How to score full marks in Protein Informatics and Cheminformatics — CBSE Class 11 Biotechnology?
Start by understanding all key concepts. Practise previous year questions from this chapter. Revise formulas and definitions regularly. Use flashcards for quick revision before the exam.
Where can I get free NCERT Solutions for Protein Informatics and Cheminformatics Class 11 Biotechnology?
This page has free step-by-step NCERT Solutions for every exercise question in Protein Informatics and Cheminformatics (CBSE Class 11 Biotechnology) — written the way examiners award marks: given, formula, working, answer.

Sources & Official References

Content is aligned to the official syllabus. Refer to the board website for the latest curriculum.

For serious students

Get the full Protein Informatics and Cheminformatics chapter — for free.

Quizzes, flashcards, AI doubt-solver and a step-by-step study plan for CBSE Class 11 Biotechnology.