Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6538395
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 25, 20262026-05-25T10:43:50+00:00 2026-05-25T10:43:50+00:00

I have a dual socket 8 core processor, that is, each processor has 4-cores

  • 0

I have a dual socket 8 core processor, that is, each processor has 4-cores in it. I haven’t seen its specification completely, but I think that a separate memory bank is attached to each processor in a ccNUMA fashion and therefore accessing from memory bank of another processor is relatively slow. Also they have different L3 caches I suppose.

Now my question is what is the fastest way to share data between the two processors. Simple shared memory will have the problem due to ccNUMA and cache coherency. Is there any way which is very fast?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-25T10:43:51+00:00Added an answer on May 25, 2026 at 10:43 am

    That would depend highly on the nature of what you’re trying to implement. From what I’ve seen, it’s usually possible to do better with a very tightly managed shared memory approach than to resort to MPI. (because it’s possible to do a lot more with shared-memory)

    However, it’s harder to go completely wrong with MPI since there’s a lot less guess work to why X performs well or not.

    Here are some common approaches using shared memory:

    Read-Only data: If the data is small enough, it might be best to duplicate it across all the nodes.

    If your memory access has extremely high spatial locality that doesn’t “migrate” around, organize your data such that each “group” of spatial locality access is on the same node.

    If your memory access pattern exhibits high temporal locality, but not enough spatial locality to fit into cache, then consider copying the data into a local buffer. Once the work is done, copy it back. This lets you keep the same structure of the program.

    EDIT: Consider adding the “NUMA” tag to your question.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

We have an application that uses a dual monitor setup - User A will
Even though I have a robust and fast computer (Pentium Dual Core 2.0 with
I have a simple screensaver that I've written, which has been deployed to all
My openMP version did not give any speed boost. I have a dual core
Currently we have developed application using Java 6 based on windows 32bit(Dual core &
Have you ever seen any of there error messages? -- SQL Server 2000 Could
On Unix one can have single socket used for both IPv6 and IPv4 (
I have a couple of stored procedures on SQL Server 2005 that I've noticed
I have put together a small ASP.NET MVC 2 site that does some very
I have a process that feeds a piece of hardware (data transmission device) with

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.