Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 120687
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 11, 20262026-05-11T03:51:03+00:00 2026-05-11T03:51:03+00:00

I am looking for a way to extract / scrape data from Word files

  • 0

I am looking for a way to extract / scrape data from Word files into a database. Our corporate procedures have Minutes of Meetings with clients documented in MS Word files, mostly due to history and inertia.

I want to be able to pull the action items from these meeting minutes into a database so that we can access them from a web-interface, turn them into tasks and update them as they are completed.

Which is the best way to do this:

  1. VBA macro from inside Word to create CSV and then upload to the DB?
  2. VBA macro in Word with connection to DB (how does one connect to MySQL from VBA?)
  3. Python script via win32com then upload to DB?

The last one is attractive to me as the web-interface is being built with Django, but I’ve never used win32com or tried scripting Word from python.

EDIT: I’ve started extracting the text with VBA because it makes it a little easier to deal with the Word Object Model. I am having a problem though – all the text is in Tables, and when I pull the strings out of the CELLS I want, I get a strange little box character at the end of each string. My code looks like:

sFile = 'D:\temp\output.txt' fnum = FreeFile Open sFile For Output As #fnum  num_rows = Application.ActiveDocument.Tables(2).Rows.Count  For n = 1 To num_rows     Descr = Application.ActiveDocument.Tables(2).Cell(n, 2).Range.Text     Assign = Application.ActiveDocument.Tables(2).Cell(n, 3).Range.Text     Target = Application.ActiveDocument.Tables(2).Cell(n, 4).Range.Text     If Target = '' Then         ExportText = ''     Else         ExportText = Descr & Chr(44) & Assign & Chr(44) & _             Target & Chr(13) & Chr(10)         Print #fnum, ExportText     End If Next n  Close #fnum 

What’s up with the little control character box? Is some kind of character code coming across from Word?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. 2026-05-11T03:51:03+00:00Added an answer on May 11, 2026 at 3:51 am

    Word has a little marker thingy that it puts at the end of every cell of text in a table.

    It is used just like an end-of-paragraph marker in paragraphs: to store the formatting for the entire paragraph.

    Just use the Left() function to strip it out, i.e.

     Left(Target, Len(Target)-1)) 

    By the way, instead of

     num_rows = Application.ActiveDocument.Tables(2).Rows.Count  For n = 1 To num_rows       Descr = Application.ActiveDocument.Tables(2).Cell(n, 2).Range.Text 

    Try this:

     For Each row in Application.ActiveDocument.Tables(2).Rows       Descr = row.Cells(2).Range.Text 
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I am looking for a way to extract an icon from a .exe file
I'm looking for a way to extract the audio part of a FLV file.
I'm looking for way to PHP to detect if a script was run from
I'm looking for some way to effectively hide inherited members. I have a library
While dealing with JSON returned from YQL, I found myself looking for a way
I'm looking for a way to extract the essence of a signature in Java.
I am looking for a way to extract a nested inner class so that
I'm looking for a way that I can extract the first letter of each
Looking for a way of getting the path from an URL in PHP: I
I am using the jquery-ui-dialog plugin I am looking for way to refresh the

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.