Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 3235548
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 17, 20262026-05-17T17:31:09+00:00 2026-05-17T17:31:09+00:00

I’m looking at accepting a project that would require me to clean up an

  • 0

I’m looking at accepting a project that would require me to clean up an existing e-commerce website. Its been relatively successful and has over 100,000 individual products – loaded both by the client and its publishers.

The site wasn’t originally designed for this many products and has become fairly disorganized.

SO, the client has asked I look at a more robust search option – filterable and so forth. I completely agree it needs to be improved, but after looking at the database, I can tell that there are dozens and dozens of categories and not everything is labeled correctly etc.

Is there any database management software that could help me clean up 100,000 entries quickly? Make categories consistent – fix uppercase/lowercase problems etc.

Are there any companies out there that I can source just this particular part of the project to?

Its a massive amount of data-entry. If I spent 2 minutes per product, it would take me 6 months full time to just to complete the database cleanup. I either need to get it down to a matter of seconds per product or find a company that specializes in this type of work.

I don’t even know what to search for on Google.

Thanks guys!

—

Thanks everyone for your ideas! I have a lot of options now so I feel a lot more comfortable heading in to this project. Right now I think the direction we will go is to build a tool that allows the client to hire data entry people that can update it as necessary. Then I will work as a consultant, taking care of any UPDATE-WHERE type functions as necessary.

Thanks again!

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-17T17:31:10+00:00Added an answer on May 17, 2026 at 5:31 pm

    In any case you’ll probability need to do more than “clean the data”, which means you’ll need to build new normalized tables. So start there, build a new database that is fully normalized, import the data “as is”, with all the duplicate categories, etc.

    for example, new tables:

    Items
    ItemID     int identity/auto number
    ItemName   string
    CategoryID int
    ....
    
    Categories
    CategoryID    int identity/auto number
    CategoryName  string
    ....
    

    import the bad data into the new system:

    Items
    ItemID  ItemName CategoryID
    1       thing A  1
    2       thing B  2
    3       thing C  3
    4       thing D  1
    
    Categories
    CategoryID CategoryName
    1          Game
    2          food
    3          games
    

    now, you can consolidate the data using the PKs

    UPDATE Items
        SET CategoryID=1
        WHERE CategoryID=3
    
    DELETE Categories
        WHERE CategoryID=3
    

    You might just write an application where the customer can do the consolidation. Let them select the duplicates on a screen and merge to a selected parent category. you have this application do the merge sql from above.

    If there are issues of needing to have a clean cut over date, create an application that generates a series of “Map” tables, where you store the CategoryNameOld=”games” and the CategoryNameNew=”Game” and use these when you do the conversion/load of the bad data into the new system’s tables.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

No related questions found

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.