I have a text file that contains protein sequences. I would like to get the total number of residues in each sequence. How can I do this with awk?
>1GS9
PYCPAAVIAPVV
>1LE2
DFEFAKWKN
>1NFN
ADAPPDS
Desired output
1GS9 - 12
1LE2 - 9
1NFN - 7
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
Read every odd line automatically with
{ ... }and proteins in even lines withgetline:Output: