Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8145187
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 6, 20262026-06-06T13:35:13+00:00 2026-06-06T13:35:13+00:00

I have a program that creates multiple text files of rdf triples. I need

  • 0

I have a program that creates multiple text files of rdf triples. I need to compare the triples and do it fast, what is the best way to do this? I thought of putting the triples into an array and comparing them but there could potentially be hundreds of thousands of triples per file and that would take forever. I need it to be as close to realtime as possible since the triples will be genreated constantly amoung the files. Any help would be great. The files are also in AllegroGraph repository’s if it’s easier to compare them there somehow.

A thought: if I stored the triples in excel (one triple per row) and one sheet per repository,

A: how could I find the duplicates amoung the sheets.
B: would it be fast.
and C: how could I automate that from Java?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-06T13:35:14+00:00Added an answer on June 6, 2026 at 1:35 pm

    You need to build a master index that will store each triple and in how many files it appears and the exact file name and location of the triple within each file. You can search the master index to answer the queries in real-time.

    As you update, delete or create new rdf files, you need to update the master index.

    You need to store the master index so that it can be updated, searched efficiently.

    Simple choice could be to use relational database (like MySql) to store the master index. It can answer you queries like finding common triples with simple select statement select * from rdfindex where triplecount > 2.

    EDIT: You cannot store hundreds of thousands of triples in memory using HashMap or similar datastructure. That’s why I suggested using database, which can store the data and respond to your queries efficiently. You can look at embedded database like SQLite to store the data.

    Read upon these topics

    How to create SQLite database and create tables, access tables etc., Create a simple table to store triple, triplecount, filenames.

    Convert all your Excel files to CSV files. You can use opencsv to parse the file in Java (check out the samples that come with opencsv).

    Parse the CSV files and load the data into SQLite. If the triple is already in the database, then just update the count, if not insert the triple.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have a Delphi program that creates HTML files. Later when a button is
I have made a program that scans rss feeds. This same program creates feeds
I have been making a program that creates multiple CSV's from another source CSV
So, I have this project that creates multiple instances of a class, and list
I have a windows form that creates multiple console applications of the same program
Greetings I have a program that creates multiples instances of a class, runs the
I have a program that creates a JFrame and makes it visible. Is there
I have a program that creates a small file in the Bin directory for
I have a program that creates semaphore. But when i try to use SETALL,
I have a simple program that creates a thread, loops twenty times and then

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.