Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 1081355
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 16, 20262026-05-16T22:09:31+00:00 2026-05-16T22:09:31+00:00

I am trying to parse a line in a mmCIF Protein file into separate

  • 0

I am trying to parse a line in a mmCIF Protein file into separate tokens using Excel 2000/2003. Worst case it COULD look something like this:

token1 token2 "token's 1a',1b'" 'token4"5"' 12 23.2 ? . 'token' tok'en to"ken

Which should become the following tokens:

token1  
token2  
token's 1a',1b' (note: the double quotes have disappeared)  
token4"5" (note: the single quotes have disappeared)  
12  
23.2  
?  
.  
token (note: the single quotes have disappeared)  
to'ken  
to"ken  

I am looking to see if a RegEx is even possible to split this kind of line into tokens?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-16T22:09:31+00:00Added an answer on May 16, 2026 at 10:09 pm

    Nice puzzle. Thanks.

    This pattern (aPatt below) gets the tokens separated, but I can’t figure how to remove the outer quotes.

    tallpaul() produces:

     token1
     token2
     "token's 1a',1b'"
     'token4"5"'
     12
     23.2
     ?
     .
     'token'
     tok'en
     to"ken
    

    If you can figure out how to lose the outer quotes, please let us know.
    This needs a reference to “Microsoft VBScript Regular Expressions” to work.

    Option Explicit
    ''returns a list of matches
    Function RegExpTest(patrn, strng)
       Dim regEx   ' Create variable.
       Set regEx = New RegExp   ' Create a regular expression.
       regEx.Pattern = patrn   ' Set pattern.
       regEx.IgnoreCase = True   ' Set case insensitivity.
       regEx.Global = True   ' Set global applicability.
       Set RegExpTest = regEx.Execute(strng)   ' Execute search.
    End Function
    
    Function tallpaul() As Boolean
        Dim aString As String
        Dim aPatt As String
        Dim aMatch, aMatches
    
        '' need to pad the string with leading and trailing spaces.
        aString = " token1 token2 ""token's 1a',1b'"" 'token4""5""' 12 23.2 ? . 'token' tok'en to""ken "
        aPatt = "(\s'[^']+'(?=\s))|(\s""[^""]+""(?=\s))|(\s[\w\?\.]+(?=\s))|(\s\S+(?=\s))"
        Set aMatches = RegExpTest(aPatt, aString)
    
        For Each aMatch In aMatches
              Debug.Print aMatch.Value
        Next
        tallpaul = True
    End Function
    
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm trying to parse a 6,000 line 500 KB file into an array so
I'm trying to parse an INI file using C++. Any tips on what is
I am trying to parse line items from mtl file and use the values
I'm trying to parse command line arguments in an F# application. I'm using pattern
I'm trying to parse C-style multi-line comments in my flex (.l) file: %s ML_COMMENT
I'm trying to parse an xhml file using SAXParser in Java, but gets an
I am trying to parse JSON in an Adobe Flex app, using http://www.mikechambers.com/blog/2006/03/28/tutorial-using-json-with-flex-2-and-actionscript-3/'>This Tutorial
I'm trying to parse an array of JSON objects into an array of strings
I'm trying to parse some JSON using the JSon.Net library. The documentation seems a
I am trying to parse a multi line string and get the rest of

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.