Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 669125
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 14, 20262026-05-14T00:07:01+00:00 2026-05-14T00:07:01+00:00

I need some help in solving this problem. We have a large amount of

  • 0

I need some help in solving this problem.

We have a large amount of documents of a given specified domain. These documents are from differente sources and therefore their structure can be very different too. On the other side I have a table with some specified fields where some figures has to be filled from the extract of the documents.

For example:

Company x had a business volume of
$20mio in 2010. $1,000,000 was the exchange of
company y this year.

The result should something like this

|| Company | Year | Volume  
||  X      | 2010 |  200,000  
||  Y      | 2010 | 1000,000  

Can you point me please to some links or topics, where I can find further informations how to solve such a problem.

I know that there is no out of the box solution for this, but where should i start to look for.

Thanks in advance.

  • 1 1 Answer
  • 1 View
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-14T00:07:01+00:00Added an answer on May 14, 2026 at 12:07 am

    Ok. There are entire computer science labs devoted to that sort of stuff!
    Maybe start by looking a tool called RapidMiner

    Also here are a couple of research paper titles I have as PDF’s (which I don’t have links for anymore sadly):

    1. Automated Understanding of Financial Statements
    Using Neural Networks and Semantic Grammars

    James Markovitch
    Dun & Bradstreet, Search Technologies
    April 1995
    Email: jsmarkovitch@yahoo.com
    Copyright  1995 James Markovitch

    2. An Integrated Approach for Automatic Semantic Structure Extraction in Document Images

    Margherita Berardi, Michele Lapi, and Donato Malerba
    Dipartimento di Informatica – Università degli Studi di Bari
    via Orabona 4 – 70126 Bari
    {berardi,lapi,malerba}@di.uniba.it

    I think the first one would be of greatest interest in terms of what you are after. Not quite sure how much value it will be though 🙂

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 372k
  • Answers 372k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer In your templates directory create a folder admin, in this… May 14, 2026 at 7:12 pm
  • Editorial Team
    Editorial Team added an answer Seems to me initialize should be an instance method rather… May 14, 2026 at 7:12 pm
  • Editorial Team
    Editorial Team added an answer Using TSQL, it's part of ALTER DATABASE not CREATE DATABASE.… May 14, 2026 at 7:12 pm

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.