Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 9232529
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 18, 20262026-06-18T06:20:10+00:00 2026-06-18T06:20:10+00:00

Hi I am profiling with VTUNE (an intel visual studio extension) my 2D numerical

  • 0

Hi I am profiling with VTUNE (an intel visual studio extension) my 2D numerical model I wrote for my research, in order to speed it up a little. I already sped up my 1D model this way (i.e. identifying the “hotspot” of my model). This time though, after running the profiler I see that the most time consuming part is not a fortran subroutine I wrote (as it occured for my 1D model) but it is a dll called Acxtrnal.dll. I googled the name of this dll but I could not find better information. Does anybody know why this dll is taking so much and what it is needed for?
thanks
A.

EDIT: So I was able to add download the symbols for the DLL from Microsoft website so now when debugging it shows that the CPU time is lost here.
NS_FaultTolerantHeap::APIHook_RtlFreeHeap. If I expand it shows (uppercase subroutines are mine):

free<-for__free_vm
for_write_int_fmt_xmit<-for_write_int_fmt<-LIMITERSUBR<-RECMUSCL<-MAIN__<-main<-_tmainCRTStartup<-BaseThreadInitThunk<-RtlUserThreadStart<-RtlUserThreadStart
for
_release_lun<-for_write_int_fmt_xmit<-for_write_int_fmt<-LIMITERSUBR<-RECMUSCL<-MAIN
<-main<-tmainCRTStartup<-BaseThreadInitThunk<-_RtlUserThreadStart<-_RtlUserThreadStart

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-18T06:20:11+00:00Added an answer on June 18, 2026 at 6:20 am

    Good, you took a couple stack samples, shown here. Your RECMUSCL is calling LIMITERSUBR, which is calling for_write_int_fmt, which is doing a lot of stuff.

    free
    for__free_vm
    for_write_int_fmt_xmit
    for_write_int_fmt
    LIMITERSUBR   <------ Look at the line in LIMITERSUBR that prints integers
    RECMUSCL              because it appears on both stack samples
    MAIN__
    main
    _tmainCRTStartup
    BaseThreadInitThunk
    __RtlUserThreadStart
    _RtlUserThreadStart  
    
    for__release_lun
    for_write_int_fmt_xmit
    for_write_int_fmt
    LIMITERSUBR
    RECMUSCL
    MAIN__
    main
    _tmainCRTStartup
    BaseThreadInitThunk
    __RtlUserThreadStart
    _RtlUserThreadStart
    

    You could look on the stack sample at the line of code in LIMITERSUBR where you are writing integers, and see if you need to be doing that.

    (You see, you didn’t really need the symbols in the system dll 🙂

    It’s good that you took two stack samples, so you could see the problem twice.
    Seeing a problem once is not enough unless you know in advance that you have a really serious slowdown.
    Seeing it twice in so few samples means it is responsible for a large fraction of the time, like more than 50 percent and possibly close to 100, so it’s worthwhile trying to fix.
    (Actually it’s a Beta distribution whose most likely value is 2/2 = 100%.)

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm profiling my application in Visual Studio and the function with the most exclusive
I am profiling a C++ application with Intel VTune Amplifier. Most of the time
Have you used any profiling tool like Intel Vtune analyzer? What are your recommendations
I've used two profiling tools (VTune on windows and dbx (within sunstudio) on Solaris)
I'm profiling some multi-threaded CPython code. In order to measure the time it takes
Running through all the questions on profiling tools, I was surprised to discover VTune
Profiling shows this is the slowest segment of my code for a little word
while profiling a java application that calculates hierarchical clustering of thousands of elements I
I'm profiling some code, and cProfile reports that almost all the time is spent
I am currently profiling my node.js application.I found this blog: http://blog.nodejs.org/2012/04/25/profiling-node-js/ that suggests I

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.