Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 468513
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 12, 20262026-05-12T23:41:22+00:00 2026-05-12T23:41:22+00:00

I am connecting to a sockets API that is very inflexible. It will return

  • 0

I am connecting to a sockets API that is very inflexible. It will return rows such as:

NAME, CITY, STATE, JOB, MONTH

But will have duplicates because it does not do any aggregation. I need to count the duplicate rows (which would be very easy in SQL, but not, as far as I know, in Java).

Example source data:

NAME,     CITY, STATE, JOB,         MONTH
John Doe, Denver, CO, INSTALLATION, 090301
John Doe, Denver, CO, INSTALLATION, 090301
John Doe, Denver, CO, INSTALLATION, 090301
Jane Doe, Phoenix, AZ, SUPPORT, 090301

Intended:

    NAME,    CITY, STATE,          JOB,  MONTH, COUNT
John Doe,  Denver,    CO, INSTALLATION, 090301,   3
Jane Doe, Phoenix,    AZ,      SUPPORT, 090301,   1

I can easily do this for approximately 100,000 return rows, but I am dealing with about 60 million in a month. Any ideas?

Edit: Unfortunately, the rows are not returned sorted… nor is there an option through the API to sort them. I get this giant mess of stuff that needs to be aggregated. Right now I use an ArrayList and do indexOf(new row) to find if the item already exists, but it gets slower the more rows that there are.

Edit: For clarification, this would only need to be run once a month, at the end of the month. Thank you for all of the responses

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-12T23:41:22+00:00Added an answer on May 12, 2026 at 11:41 pm

    You could use a HashSet to store the previous row with the same contents. (assuming your Row objects have proper .hashValue() and .equals() methods implemented.

    Something like this perhaps:

    Set<Row> previousRows = new HashSet<Row>();
    List<Row> rowsInOrder = new LinkedList<Row>();
    

    Then in use (assuming further that you have an incrementCount() method to the Row class):

    Row newRow = getNextRow();
    if(!previousRows.contains(newRow)){
        previousRows.put(newRow);
        rowsInOrder.add(newRow);
    } 
    previousRows.get(newRow).incrementCount();
    

    If you don’t care about the order in which the rows came in, you can get rid of the List and just use the Set.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 234k
  • Answers 234k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer There are lots of instructions on how to use JQuery.… May 13, 2026 at 5:59 am
  • Editorial Team
    Editorial Team added an answer it's not pretty, but you can try this in your… May 13, 2026 at 5:59 am
  • Editorial Team
    Editorial Team added an answer Write code Read books, http://www.coderholic.com/free-python-programming-books/ Read code Read tutorials, http://www.dabeaz.com/talks.html,… May 13, 2026 at 5:59 am

Related Questions

I have a C# module responsible for acquiring the list of network adapters that
Most of the applications I've seen that use TCP, do roughly the following to
I am looking to optimize a process that runs continually and makes frequent calls
I have a Win32 application that uses boost::asio and openssl library but it seems

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.