Free Shipping on Orders $200+
Research Article

Amino Acid Properties Reference: Molecular Weights, Codes, and Physicochemical Data

Complete reference table of the 20 standard amino acids with three-letter codes, single-letter codes, molecular weights, side chain pKa values, hydrophobicity indices, and physicochemical properties for peptide research applications.

Published: March 6, 202615 min read

Amino Acid Nomenclature and Coding Systems

The twenty standard amino acids that constitute the building blocks of peptides and proteins are each designated by two standardized abbreviation systems: a three-letter code and a single-letter code. These nomenclature systems were established by the IUPAC-IUB Joint Commission on Biochemical Nomenclature (JCBN) and are universally used in biochemistry, molecular biology, and peptide chemistry. Familiarity with both coding systems is essential for interpreting peptide sequences on Certificates of Analysis, research publications, and structural databases.

The three-letter codes (e.g., Ala for alanine, Gly for glycine, Phe for phenylalanine) are derived from the first three letters of the amino acid name in most cases, with exceptions for asparagine (Asn, to distinguish from aspartate, Asp), glutamine (Gln, to distinguish from glutamate, Glu), isoleucine (Ile), and tryptophan (Trp). Three-letter codes are the standard format for written descriptions of peptide sequences in COA documentation, product catalogs, and detailed experimental protocols.

Single-letter codes provide a compact representation for longer sequences and are the standard format for bioinformatics databases (UniProt, PDB, NCBI), sequence alignment tools, and mass spectrometry analysis software. The assignment of single-letter codes is partially mnemonic but becomes arbitrary where multiple amino acids share the same first letter. The original assignments, as documented by the IUPAC-IUB Commission (European Journal of Biochemistry, 1984, DOI: 10.1111/j.1432-1033.1984.tb07877.x), use intuitive assignments where possible: A for alanine, G for glycine, L for leucine, V for valine. Others use phonetic or associative mnemonics: F for phenylalanine (sounds like F), W for tryptophan (the indole ring resembles a W), Y for tyrosine (sounds like Y).

In addition to the 20 standard codes, several special symbols are used in sequence notation. The letter B represents Asx (asparagine or aspartate, when the identity is ambiguous), Z represents Glx (glutamine or glutamate), J represents Xle (leucine or isoleucine, which are isobaric and indistinguishable by mass spectrometry), and X represents any amino acid. These ambiguity codes appear primarily in sequence analysis contexts and should not appear on COAs, where the sequence should be fully determined. For a comprehensive database of peptide structures and sequences, see our structural peptide database.

Complete Amino Acid Properties Table

The following table presents the physicochemical properties of all 20 standard amino acids. Molecular weights are given as average isotopic masses (used for gravimetric calculations) and monoisotopic masses (used for high-resolution mass spectrometry data interpretation). Residue masses represent the molecular weight contribution of each amino acid within a peptide chain (amino acid MW minus 18.015 Da for the water molecule lost during peptide bond formation). Hydrophobicity values follow the Kyte-Doolittle scale (Journal of Molecular Biology, 1982, DOI: 10.1016/0022-2836(82)90515-0), where positive values indicate hydrophobic and negative values indicate hydrophilic character. Side chain pKa values are from Pace, Grimsley, and Scholtz (Journal of Biological Chemistry, 2009, DOI: 10.1074/jbc.R800080200).

Amino Acid3-Letter1-LetterAvg MW (Da)Mono MW (Da)Residue MW (Da)pKa (Side)Hydro (KD)Classification
AlanineAlaA89.09489.04871.0371.8Non-polar, aliphatic
ArginineArgR174.203174.112156.10112.48−4.5Positively charged
AsparagineAsnN132.119132.053114.043−3.5Polar, uncharged
AspartateAspD133.104133.038115.0273.65−3.5Negatively charged
CysteineCysC121.159121.020103.0098.182.5Polar, uncharged
GlutamateGluE147.130147.053129.0434.25−3.5Negatively charged
GlutamineGlnQ146.146146.069128.058−3.5Polar, uncharged
GlycineGlyG75.03275.03257.021−0.4Non-polar, aliphatic
HistidineHisH155.156155.069137.0596.00−3.2Positively charged
IsoleucineIleI131.175131.095113.0844.5Non-polar, aliphatic
LeucineLeuL131.175131.095113.0843.8Non-polar, aliphatic
LysineLysK146.189146.106128.09510.53−3.9Positively charged
MethionineMetM149.208149.051131.0401.9Non-polar, aliphatic
PhenylalaninePheF165.192165.079147.0682.8Non-polar, aromatic
ProlineProP115.132115.06397.053−1.6Non-polar, aliphatic
SerineSerS105.093105.04387.032−0.8Polar, uncharged
ThreonineThrT119.119119.058101.048−0.7Polar, uncharged
TryptophanTrpW204.229204.090186.079−0.9Non-polar, aromatic
TyrosineTyrY181.191181.074163.06310.07−1.3Polar, aromatic
ValineValV117.148117.07999.0684.2Non-polar, aliphatic

KD = Kyte-Doolittle hydrophobicity index. Positive values indicate hydrophobic residues; negative values indicate hydrophilic residues. pKa values shown are for isolated amino acids; actual pKa values within a peptide may shift by 1-2 units depending on the local microenvironment.

Molecular Weight Calculations for Peptides

Calculating the theoretical molecular weight of a peptide from its amino acid sequence is a fundamental operation in peptide chemistry. The calculation uses the residue molecular weights from the table above and accounts for the water molecules lost during peptide bond formation through condensation. For a peptide containing n amino acid residues, the molecular weight is calculated as the sum of all n residue masses plus 18.015 Da for the terminal water molecule (representing the H at the N-terminus and the OH at the C-terminus).

Equivalently, the peptide molecular weight can be calculated by summing the full amino acid molecular weights and subtracting (n-1) × 18.015 Da, accounting for the (n-1) water molecules released during the formation of (n-1) peptide bonds. Both approaches yield identical results. For example, the tripeptide Ala-Gly-Val has a monoisotopic molecular weight of: 71.037 (Ala residue) + 57.021 (Gly residue) + 99.068 (Val residue) + 18.015 (terminal water) = 245.141 Da.

Post-translational and chemical modifications alter the calculated molecular weight and must be accounted for when comparing theoretical and observed masses on a COA. Common modifications include: N-terminal acetylation (+42.011 Da), C-terminal amidation (-0.984 Da, replacing OH with NH2), methionine oxidation to sulfoxide (+15.995 Da), disulfide bond formation (-2.016 Da per bond, loss of two hydrogen atoms), and asparagine deamidation (+0.984 Da). The mass of counter-ions (TFA: 113.993 Da; acetate: 59.013 Da) is not included in the peptide molecular weight but contributes to the gross weight of the lyophilized product.

The choice between monoisotopic and average molecular weight depends on the mass spectrometry instrument and the size of the peptide. For peptides below approximately 2,000 Da analyzed on high-resolution instruments (Orbitrap, Q-TOF), the isotope peaks are resolved and the monoisotopic mass is compared against the theoretical monoisotopic value. For larger peptides where individual isotope peaks are not resolved, the centroid of the isotope envelope yields the average molecular weight, which is compared against the theoretical average value. COA mass spectrometry data should specify which value is reported. For additional context on mass spectrometric methods, see our mass spectrometry for peptide analysis article.

Hydrophobicity and Chromatographic Behavior

Amino acid hydrophobicity directly determines the reversed-phase HPLC retention behavior of peptides, making it one of the most practically relevant physicochemical properties for analytical characterization. The Kyte-Doolittle hydrophobicity scale, published in 1982 (Journal of Molecular Biology, DOI: 10.1016/0022-2836(82)90515-0), assigns each amino acid a numerical value reflecting its relative hydrophobicity based on experimentally determined water-to-vapor transfer free energies and the interior/exterior distribution of residues in known protein structures.

On the Kyte-Doolittle scale, the most hydrophobic residues are isoleucine (+4.5), valine (+4.2), and leucine (+3.8), while the most hydrophilic residues are arginine (-4.5), lysine (-3.9), and asparagine/aspartate (-3.5). In reversed-phase HPLC, peptides with higher average hydrophobicity interact more strongly with the C18 stationary phase, requiring higher concentrations of organic solvent (typically acetonitrile) to elute. This means that more hydrophobic peptides have longer retention times and elute later in the gradient.

The relationship between peptide sequence and HPLC retention time has been modeled quantitatively. Mant et al. (Methods in Molecular Biology, 2007) demonstrated that retention time can be predicted with reasonable accuracy by summing the retention coefficients of individual amino acid residues, with corrections for chain length and nearest-neighbor effects. While detailed retention prediction is beyond the scope of this reference, the general principle is useful for evaluating COA chromatograms: a peptide with a highly hydrophobic sequence should elute at a relatively high percentage of organic solvent, and deviations from this expectation may indicate modifications or impurities.

Hydrophobicity also influences peptide solubility and aggregation propensity. Peptides with average Kyte-Doolittle values above +1.0 (calculated as the mean of all residue values in the sequence) tend to have limited aqueous solubility and may require organic co-solvents (DMSO, acetonitrile, or dilute acetic acid) for reconstitution. Such peptides are also more prone to aggregation through hydrophobic self-association. These practical considerations are relevant for peptide storage and handling. For information on how HPLC separates peptides based on these hydrophobic interactions, see our guide on HPLC purity testing.

Ionizable Side Chains and pKa Values

Seven of the twenty standard amino acids have side chains with ionizable functional groups: aspartate (carboxyl, pKa 3.65), glutamate (carboxyl, pKa 4.25), histidine (imidazole, pKa 6.00), cysteine (thiol, pKa 8.18), tyrosine (phenol, pKa 10.07), lysine (amino, pKa 10.53), and arginine (guanidinium, pKa 12.48). In addition, the N-terminal alpha-amino group (pKa approximately 8.0) and C-terminal alpha-carboxyl group (pKa approximately 3.1) are ionizable. These pKa values collectively determine the charge state of a peptide at any given pH.

The theoretical isoelectric point (pI) of a peptide is the pH at which the net charge is zero. It can be calculated from the pKa values of all ionizable groups in the sequence using the Henderson-Hasselbalch equation. At pH values below the pI, the peptide carries a net positive charge; above the pI, it carries a net negative charge. The pI is relevant for predicting solubility (minimum solubility occurs near the pI), electrophoretic mobility, and ion-exchange chromatographic behavior.

In the context of peptide degradation, the pKa of cysteine (8.18) is particularly significant because the thiolate anion (RS-) is the reactive species in disulfide exchange reactions. At pH 7.0, approximately 6% of cysteine side chains are deprotonated (thiolate form), while at pH 8.0, this increases to approximately 40%. This pH dependence explains why disulfide scrambling accelerates dramatically above pH 7 and why acidic storage conditions are recommended for disulfide-containing peptides, as detailed in our article on peptide degradation mechanisms.

Histidine is unique among the ionizable amino acids because its imidazole side chain has a pKa (6.00) near physiological pH, meaning that its protonation state changes significantly over the pH range commonly used in biological buffers. This property makes histidine-containing peptides sensitive to pH-dependent changes in charge, solubility, and chromatographic retention. The protonation of histidine also affects metal coordination, as the unprotonated imidazole nitrogen serves as a ligand for transition metal ions (Cu2+, Zn2+, Ni2+), which is relevant for metal-binding peptides in the research catalog such as copper-binding tripeptides.

Amino Acid Classification and Research Relevance

The twenty standard amino acids are classified based on the physicochemical properties of their side chains into several categories: non-polar aliphatic (Gly, Ala, Val, Leu, Ile, Pro, Met), aromatic (Phe, Trp, Tyr), polar uncharged (Ser, Thr, Cys, Asn, Gln), positively charged at pH 7 (Lys, Arg, His), and negatively charged at pH 7 (Asp, Glu). This classification reflects the chemical behavior of each residue and has direct implications for peptide properties including solubility, stability, chromatographic behavior, and reactivity.

Non-polar aliphatic residues contribute to the hydrophobic core of folded peptides and are the primary determinants of reversed-phase HPLC retention. Proline is unique within this group because its side chain forms a covalent bond with the backbone nitrogen, creating a pyrrolidine ring that constrains the backbone dihedral angles and introduces kinks or turns in the peptide chain. This structural rigidity makes proline a common residue at turn positions in biologically active peptide sequences.

Aromatic residues are significant for UV detection in analytical chemistry. Tryptophan absorbs strongly at both 214 nm (peptide bond) and 280 nm (indole ring, molar absorptivity approximately 5,500 M-1cm-1), while tyrosine absorbs at 280 nm (phenol ring, approximately 1,490 M-1cm-1). Phenylalanine has weak absorption at 280 nm (approximately 1 M-1cm-1). The presence or absence of aromatic residues determines whether a peptide can be detected and quantified at 280 nm. The molar extinction coefficient values were established by Pace et al. (Protein Science, 1995, DOI: 10.1002/pro.5560040423) and are widely used for spectrophotometric concentration determination.

From a stability perspective, certain residues serve as hotspots for specific degradation pathways. Methionine and tryptophan are primary targets for oxidation. Asparagine (especially in Asn-Gly motifs) and glutamine are susceptible to deamidation. Cysteine is involved in disulfide scrambling and air oxidation. Aspartate participates in acid-catalyzed peptide bond cleavage, particularly at Asp-Pro bonds. Knowledge of these susceptibilities, combined with the sequence of a specific research peptide, allows researchers to predict the most likely degradation products and evaluate COA data accordingly.

For additional resources on peptide structure and properties, explore our introduction to peptides, structural peptide database, and research glossary. Browse our full product catalog to view research peptides with complete sequence and molecular weight data documented on batch-specific Certificates of Analysis.

Related Compounds

Research Peptides with Full Sequence Data

All products include complete amino acid sequence documentation, molecular weight data, and batch-specific Certificates of Analysis.

BPC-157 5mg
BEST SELLER
99.1%
Out of Stock
peptides

BPC-157 5mg

C62H98N16O22

Body Protection Compound-157 (BPC-157) is a pentadecapeptide composed of 15 amino acids. It is a partial sequence of body protection compound derived from human gastric juice.

Out of Stock
$44.99
TB-500 2mg
BEST SELLER
98.7%
Out of Stock
peptides

TB-500 2mg

C212H350N56O78S

Thymosin Beta-4 (TB-500) is a naturally occurring peptide present in virtually all human and animal cells. It plays a critical role in tissue repair and regeneration.

Out of Stock
$32.99
PT-141 10mg
99.3%
Out of Stock
peptides

PT-141 10mg

C50H68N14O10

PT-141 (Bremelanotide) is a synthetic peptide analog of alpha-melanocyte stimulating hormone. It was developed from Melanotan II research.

Out of Stock
$38.99
GHK-Cu 100mg
BEST SELLER
98.8%
peptides

GHK-Cu 100mg

C14H23CuN6O4

GHK-Cu (Copper Peptide) is a naturally occurring copper complex of the tripeptide glycyl-L-histidyl-L-lysine.

In Stock
$59.99
IGF-1 LR3 1mg
BEST SELLER
98.2%
Out of Stock
proteins

IGF-1 LR3 1mg

C400H625N111O115S9

IGF-1 LR3 (Insulin-like Growth Factor-1 Long R3) is a modified version of IGF-1 with an extended half-life and increased potency.

Out of Stock
$89.99
FAQ

Frequently Asked Questions

The standard amino acid nomenclature uses both three-letter abbreviations (e.g., Ala, Gly, Val) and single-letter codes (e.g., A, G, V) to represent the 20 genetically encoded amino acids. The three-letter codes were established by the IUPAC-IUB Joint Commission on Biochemical Nomenclature and are typically used in written descriptions and COA documentation. Single-letter codes, also standardized by IUPAC-IUB, are used in compact sequence representations, database entries, and bioinformatics analysis. Some codes are intuitive (G for Glycine, L for Leucine), while others are assigned because the first letter was already taken (W for Tryptophan, Y for Tyrosine, K for Lysine).

Amino acid molecular weights are used to calculate the theoretical molecular weight of a peptide from its amino acid sequence. The calculation sums the residue masses (monoisotopic or average molecular weight of each amino acid minus 18.015 Da for water lost during peptide bond formation) for all residues and adds 18.015 Da for the terminal water molecule. The resulting theoretical molecular weight is compared against the observed molecular weight from mass spectrometry on the Certificate of Analysis to confirm peptide identity. Accurate molecular weight calculation is essential for verifying synthesis fidelity and detecting modifications such as oxidation (+16 Da) or deamidation (+1 Da).

The hydrophobicity index quantifies the relative hydrophobicity of each amino acid side chain, typically measured as the free energy of transfer from water to a non-polar environment. The Kyte-Doolittle scale, published in 1982, is the most widely used hydrophobicity scale in peptide science. Hydrophobicity matters for peptide research because it directly influences HPLC retention time (more hydrophobic peptides elute later in reversed-phase chromatography), solubility (peptides with high average hydrophobicity may require organic co-solvents for reconstitution), and aggregation propensity (hydrophobic stretches promote self-association).

The pKa value of an amino acid side chain represents the pH at which 50% of that group exists in its protonated form and 50% in its deprotonated form. Only seven of the twenty standard amino acids have ionizable side chains: aspartate (pKa 3.65), glutamate (pKa 4.25), histidine (pKa 6.00), cysteine (pKa 8.18), tyrosine (pKa 10.07), lysine (pKa 10.53), and arginine (pKa 12.48). These pKa values determine the charge state of a peptide at any given pH, which in turn affects solubility, chromatographic behavior, electrostatic interactions, and the rate of pH-dependent degradation reactions such as deamidation and disulfide exchange.

To calculate the theoretical molecular weight of a peptide: (1) look up the residue molecular weight (amino acid MW minus 18.015 Da for water) for each amino acid in the sequence; (2) sum all residue weights; (3) add 18.015 Da for the terminal water molecule (H at the N-terminus and OH at the C-terminus). For example, a dipeptide Ala-Gly would be: Ala residue (71.037 Da) + Gly residue (57.021 Da) + H2O (18.015 Da) = 146.073 Da (monoisotopic). Alternatively, sum the full amino acid molecular weights and subtract (n-1) times 18.015 Da, where n is the number of residues, which yields the same result. Post-translational or chemical modifications (acetylation, amidation, oxidation) must be accounted for by adding or subtracting the appropriate mass differences.

Essential amino acids are those that cannot be synthesized de novo by the organism under study and must be obtained from external sources. In mammalian biochemistry, the nine essential amino acids are histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine. Non-essential amino acids (alanine, asparagine, aspartate, glutamate, serine) can be synthesized endogenously. Conditionally essential amino acids (arginine, cysteine, glutamine, glycine, proline, tyrosine) are normally synthesized in adequate quantities but may become essential under specific physiological conditions. This classification is relevant to cell culture media formulation for in-vitro research applications.

Glycine is unique because its side chain is a single hydrogen atom, making it the smallest amino acid (molecular weight 75.032 Da) and the only achiral amino acid (all other standard amino acids have an L-stereocenter at the alpha carbon). The absence of a side chain gives glycine maximal conformational flexibility, allowing it to adopt backbone dihedral angles that are energetically unfavorable for other residues. This flexibility is exploited in peptide structures where tight turns or flexible linker regions are required. In the context of peptide degradation, glycine at the position immediately C-terminal to asparagine (Asn-Gly) creates the most susceptible site for deamidation due to the minimal steric hindrance of the glycine side chain.

Research Use Disclaimer

This article is provided for educational and informational purposes only and is intended for qualified researchers and laboratory professionals. The content discusses amino acid properties and peptide chemistry strictly within the context of research compound characterization for in-vitro laboratory applications. All compounds referenced herein are intended for research use only (RUO) and are not intended for human consumption, diagnostic, or any clinical application. CrestBioLabs makes no claims regarding the suitability of any compound for purposes beyond scientific research. Always consult relevant institutional guidelines, safety data sheets, and applicable regulations before handling research compounds.