Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6647925
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 26, 20262026-05-26T00:35:43+00:00 2026-05-26T00:35:43+00:00

Let’s say I have a database full of Tag objects. Each Tag has an

  • 0

Let’s say I have a database full of Tag objects. Each Tag has an id and a name. In the beginning of making the database I allowed for case sensitive Tags, however, I later realized I didn’t need/want this capability, so I started forcing each name to be lowercase before storing the Tag.

Now I have all these remnants of different names which would now be stored under the same Tag but previously weren’t. For example,

Trendy, trendy
NotHalfBad, Nothalfbad, nothalfbad
SQL, sql, Sql

I am using Python and SQLAlchemy. I have created a function to clean up this mess that looks something like this:

todelete = []

for t1 in Session.query(Tag):
    if t1 not in todelete: # If we haven't already encountered this tag
        for t2 in Session.query(Tag).filter_by(name_insensitive=t1.name):
            if t1.id != t2.id:
                merge(t1,t2) # Calls a function I made that merges the two tags
                todelete.append(tag)
                Session.commit()

# Mark everything for deletion
for tag in todelete:
    Session.delete(tag)

# Now commit the deletes
Session.commit()

This is horribly inefficient. Is there a better way?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-26T00:35:44+00:00Added an answer on May 26, 2026 at 12:35 am

    If it’s a tool for one-time use, do you really have to care about efficiency? Just let it run for a minute (or several), rather than spending even more time optimizing it.

    That being said, queries are more expensive than Python loops, so loading all the Tags into a list first, and then looping over that list both times, should speed things up:

    for t1 in tags:
        if t1 not in todelete:
            for t2 in tags:
                if t2.name_insensitive == t1.name:
                    merge(t1,t2)
                    todelete.append(tag)
    

    Also, remove the commit call in the loop. Not only is it expensive, but if some other process changes the DB, the list of tags you’re looping over could get out of sync.

    Of course, the a proper way to make things more efficient is profiling first, and then concentrating on specific problems. You should do that if you’re serious about performance.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Let's say I have window.open (without name parameter), scattered in my project and I
Let's say I have two assemblies: BusinessLogic and Web. BusinessLogic has an application setting
Let's say I have the following models class Photo(models.Model): tags = models.ManyToManyField(Tag) class Tag(models.Model):
Let's say I'm building a data access layer for an application. Typically I have
Let's say you have a class called Customer, which contains the following fields: UserName
Let's say we have a simple function defined in a pseudo language. List<Numbers> SortNumbers(List<Numbers>
Let's say I have a drive such as C:\ , and I want to
Let's say that we have an ARGB color: Color argb = Color.FromARGB(127, 69, 12,
Let's say I have two tables orgs and states orgs is (o_ID, state_abbr) and
Let's say on a page I have alot of this repeated: <div class=entry> <h4>Magic:</h4>

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.