Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 9295417
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 18, 20262026-06-18T21:35:55+00:00 2026-06-18T21:35:55+00:00

I have two projects each with a massive code base. I’d like to run

  • 0

I have two projects each with a massive code base. I’d like to run a tool to go through all the files in every project and show me which files across the projects have similar code. I’m not even sure if anything like this exists but I remember been in school, teachers had a tool they ran on all code from multiple students to identify how similar their code was (to catch cheaters).

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-18T21:35:56+00:00Added an answer on June 18, 2026 at 9:35 pm

    What you want is a clone detection tool. These tools find code which duplicated across any set of files. For your task, you’d take the files for both projects, and do clone detection across that set.

    [EDIT 2019 based on real experience doing exactly what OP wants to do].

    If a clone is found in a file from one project, that corresponds to a clone found in a file from the other project, you’ve found what they have in common.

    A defect of doing straight clone detection across all files from both projects, is that you will find a lot of clones from one project into that same project. Those aren’t interesting according to your question, e.g. false positives.

    My company provides a commercial clone detector called CloneDR. It is (IMHO) an extremely good detector and will find clones that other detectors cannot (e.g. it isn’t fooled by comment changes, code layouts, number radixes, variable rename nor even insertion or deletion of code fragments). But it has one other very nice property: it has a option to detect clones only across two project code bases. You won’t get the false positives you’d get by treating the two projects as one.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have two projects and each project has Spring XML files in their respective
I have two projects and I want to combine them together.When I install each
I have two projects, appA and appB, that each depend on a third project,
I have two projects, each with its own ant build file. What should I
I am facing a strange problem here. I have two projects, each of them
I have two unit test projects in my VS 2010 solution. Each project has
I have a solution with two projects each producing a separate dll that is
I have two app's projects in two different Eclipse's workspaces. When I run one
I have two projects, the DLL project which has all my logic and data
I have two different django projects say projA and projB , each have its

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.