Loop 8

Loop statistics

Accesses 4.58e+09
Thread Id Accesses
Thread Total 4.58e+09
23408 1.52e+08
23409 2.53e+08
23410 1.95e+08
23411 1.37e+08
23412 2.24e+08
23413 1.01e+08
23414 2.02e+08
23415 1.88e+08
23416 1.73e+08
23417 1.59e+08
23418 2.17e+08
23419 1.59e+08
23420 2.39e+08
23421 1.95e+08
23422 1.88e+08
23423 2.46e+08
23424 2.46e+08
23425 1.59e+08
23426 2.53e+08
23427 1.88e+08
23428 1.59e+08
23429 1.81e+08
23430 2.39e+08
23431 1.30e+08
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 1.9%
Thread Id % of misses
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
% of bandwidth 2.6%
Thread Id % of bandwidth
Thread Total 2.6%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.3%
23413 0.3%
23414 0.3%
23415 0.3%
23416 0.0%
23417 0.0%
23418 0.3%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.3%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.3%
% of fetches 1.9%
Thread Id % of fetches
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
% of write-backs 4.1%
Thread Id % of write-backs
Thread Total 4.1%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.5%
23413 0.5%
23414 0.5%
23415 0.5%
23416 0.0%
23417 0.0%
23418 0.5%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.5%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.5%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.5%
% of upgrades 15.2%
Thread Id % of upgrades
Thread Total 15.2%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 2.5%
23414 2.5%
23415 2.5%
23416 0.0%
23417 0.0%
23418 2.5%
23419 0.0%
23420 0.0%
23421 0.0%
23422 2.5%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 2.5%
Miss ratio 1.3%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 1.3% 0.0% 0.3% 0.9% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 3.1% 0.0% 3.1% 0.0% 0.0%
23413 7.2% 0.0% 0.0% 7.1% 0.0%
23414 3.6% 0.0% 0.0% 3.6% 0.0%
23415 3.9% 0.0% 0.0% 3.8% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.3% 0.0% 0.0% 3.3% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 3.9% 0.0% 0.1% 3.8% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 2.9% 0.0% 2.9% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 5.6% 0.0% 0.0% 5.6% 0.0%
Fetch ratio 1.3%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 1.3% 0.0% 0.3% 0.9% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 3.1% 0.0% 3.1% 0.0% 0.0%
23413 7.2% 0.0% 0.0% 7.1% 0.0%
23414 3.6% 0.0% 0.0% 3.6% 0.0%
23415 3.9% 0.0% 0.0% 3.8% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.3% 0.0% 0.0% 3.3% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 3.9% 0.0% 0.1% 3.8% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 2.9% 0.0% 2.9% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 5.6% 0.0% 0.0% 5.6% 0.0%
Write-back ratio 1.3%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 1.3% 0.0% 0.3% 0.9% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 3.1% 0.0% 3.1% 0.0% 0.0%
23413 7.1% 0.0% 0.0% 7.1% 0.0%
23414 3.6% 0.0% 0.0% 3.6% 0.0%
23415 3.8% 0.0% 0.0% 3.8% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.3% 0.0% 0.0% 3.3% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 3.9% 0.0% 0.1% 3.8% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 2.9% 0.0% 2.9% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 5.6% 0.0% 0.0% 5.6% 0.0%
Upgrade ratio 0.9%
Thread Id Upgrade ratio
Thread Average 0.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 7.1%
23414 3.6%
23415 3.8%
23416 0.0%
23417 0.0%
23418 3.3%
23419 0.0%
23420 0.0%
23421 0.0%
23422 3.8%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 5.6%
Communication ratio 1.9%
Thread Id Comm. ratio
Thread Average 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 14.3%
23414 7.1%
23415 7.7%
23416 0.0%
23417 0.0%
23418 6.7%
23419 0.0%
23420 0.0%
23421 0.0%
23422 7.7%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 11.1%
Fetch utilization 51.3%
Thread Id Fetch utilization
Thread Average 51.3%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 27.1%
23413 13.2%
23414 0.3%
23415 25.9%
23416 100.0%
23417 100.0%
23418 26.6%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.1%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.7%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.2%
Write-back utilization 50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%
Communication utilization 35.7%
Thread Id Comm. utilization
Thread Average 35.7%
23408 100.0%
23409 12.5%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 12.5%
23418 100.0%
23419 12.5%
23420 100.0%
23421 16.6%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 12.5%
23430 37.5%
23431 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low

Loop instructions

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 23.0% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 23.0% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 23.0% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 23.0% ]
                "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 23.4% ]
                   "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 23.4% ]
                      "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 23.4% ]
                         "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 23.4% ]
                            "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 35.0% ]
                               "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 35.2% ]
                                  "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 38.5% ]
                                     "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 54.2% ]
                                        "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 54.2% ]
                                           "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 84.9% ]
                                              "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 84.9% ]
                                                 "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88 [ 84.9% ]
                                                    "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196 [ 84.9% ]
                                                       "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36 [ 84.9% ]
                                                          "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196 [ 84.9% ]
                                                             "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36 [ 84.9% ]
                                                                "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782 [ 84.9% ]
                                                                   "octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x3e (0x9e3e4e), invoke.hpp:36
"octotiger"!grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x690 (0x9dcf70) [R], grid_fmm.cpp:753 1.9%
Thread Id % of misses
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
1.9%
Thread Id % of fetches
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
3.9%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 3.9% 0.0% 1.0% 2.9% 0.0%
23408 0.2% 0.0% 0.2% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 8.8% 0.0% 8.8% 0.0% 0.0%
23413 25.0% 0.0% 0.0% 25.0% 0.0%
23414 10.0% 0.0% 0.0% 10.0% 0.0%
23415 14.3% 0.0% 0.0% 14.3% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 9.1% 0.0% 0.0% 9.1% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.3% 0.0% 0.3% 0.0% 0.0%
23422 10.2% 0.0% 0.2% 10.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.2% 0.0% 0.2% 0.0% 0.0%
23426 8.3% 0.0% 8.3% 0.0% 0.0%
23427 0.2% 0.0% 0.2% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 12.6% 0.0% 0.1% 12.5% 0.0%
47.8%
Thread Id Fetch utilization
Thread Average 47.8%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.8%
23413 13.0%
23414 0.1%
23415 25.8%
23416 100.0%
23417 100.0%
23418 26.4%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.6%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.1%
50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<node_server::exchange_flux_corrections()::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > >&&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x80c (0xa24acc), futures_factory.hpp:78 [ 24.5% ]
       "octotiger"!void hpx::lcos::detail::future_data<void>::set_value<hpx::util::unused_type>(hpx::util::unused_type&&, hpx::error_code&)+0x18f (0x887a1f), future_data.hpp:430 [ 24.5% ]
          "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196 [ 24.5% ]
             "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<void> > const&)+0x11c (0x959c4c), packaged_continuation.hpp:210 [ 24.5% ]
                "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x5d (0x95973d), node_server_actions_3.cpp:475 [ 24.5% ]
                   "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 24.5% ]
                      "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 24.5% ]
                         "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 24.7% ]
                            "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 24.7% ]
                               "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 24.7% ]
                                  "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 24.7% ]
                                     "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 33.6% ]
                                        "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 33.6% ]
                                           "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 37.8% ]
                                              "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 53.8% ]
                                                 "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 53.8% ]
                                                    "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 82.7% ]
                                                       "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 82.7% ]
                                                          "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88 [ 82.7% ]
                                                             "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196 [ 82.7% ]
                                                                "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36 [ 82.7% ]
                                                                   "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196 [ 82.7% ]
                                                                      "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36 [ 82.7% ]
                                                                         "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782 [ 82.7% ]
                                                                            "octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x3e (0x9e3e4e), invoke.hpp:36
"octotiger"!grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x695 (0x9dcf75) [R], grid_fmm.cpp:753 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.1% 0.0% 0.1% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.1% 0.0% 0.1% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.1% 0.0% 0.1% 0.0% 0.0%
100.0%
Thread Id Fetch utilization
Thread Average 100.0%
23408 100.0%
23409 75.1%
23410 100.0%
23411 100.0%
23412 100.0%
23413 98.4%
23414 48.4%
23415 39.2%
23416 100.0%
23417 70.2%
23418 100.0%
23419 41.3%
23420 62.6%
23421 100.0%
23422 88.2%
23423 78.1%
23424 78.8%
23425 59.3%
23426 77.4%
23427 97.1%
23428 62.3%
23429 21.2%
23430 96.1%
23431 100.0%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 22.7% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 22.7% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 22.7% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 22.7% ]
                "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 23.1% ]
                   "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 23.1% ]
                      "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 23.1% ]
                         "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 23.1% ]
                            "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 34.4% ]
                               "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 34.6% ]
                                  "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 38.0% ]
                                     "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 53.8% ]
                                        "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 53.8% ]
                                           "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 84.8% ]
                                              "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 84.8% ]
                                                 "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88 [ 84.8% ]
                                                    "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196 [ 84.8% ]
                                                       "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36 [ 84.8% ]
                                                          "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196 [ 84.8% ]
                                                             "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36 [ 84.8% ]
                                                                "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782 [ 84.8% ]
                                                                   "octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x3e (0x9e3e4e), invoke.hpp:36
"octotiger"!grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x69a (0x9dcf7a) [W], grid_fmm.cpp:753 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
47.8%
Thread Id Fetch utilization
Thread Average 47.8%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.8%
23413 13.0%
23414 0.1%
23415 25.8%
23416 100.0%
23417 100.0%
23418 26.4%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.6%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.1%
50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%

Bandwidth issues related to this this loop

# Issue type % of bandwidth % of fetches % of write-backs Fetch utilization Write-back utilization
37 Inefficient loop nesting2.6%
Thread Id% of bandwidth
Thread Total2.6%
234080.0%
234090.0%
234100.0%
234110.0%
234120.3%
234130.3%
234140.3%
234150.3%
234160.0%
234170.0%
234180.3%
234190.0%
234200.0%
234210.0%
234220.3%
234230.0%
234240.0%
234250.0%
234260.3%
234270.0%
234280.0%
234290.0%
234300.0%
234310.3%
1.9%
Thread Id% of fetches
Thread Total1.9%
234080.0%
234090.0%
234100.0%
234110.0%
234120.2%
234130.2%
234140.2%
234150.2%
234160.0%
234170.0%
234180.2%
234190.0%
234200.0%
234210.0%
234220.2%
234230.0%
234240.0%
234250.0%
234260.2%
234270.0%
234280.0%
234290.0%
234300.0%
234310.2%
4.1%
Thread Id% of write-backs
Thread Total4.1%
234080.0%
234090.0%
234100.0%
234110.0%
234120.5%
234130.5%
234140.5%
234150.5%
234160.0%
234170.0%
234180.5%
234190.0%
234200.0%
234210.0%
234220.5%
234230.0%
234240.0%
234250.0%
234260.5%
234270.0%
234280.0%
234290.0%
234300.0%
234310.5%
47.8%
Thread IdFetch utilization
Thread Average47.8%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
2341226.8%
2341313.0%
234140.1%
2341525.8%
23416100.0%
23417100.0%
2341826.4%
23419100.0%
23420100.0%
23421100.0%
234221.0%
23423100.0%
23424100.0%
23425100.0%
2342613.6%
23427100.0%
23428100.0%
23429100.0%
23430100.0%
2343138.1%
50.2%
Thread IdWrite-back utilization
Thread Average50.2%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
2341226.2%
2341325.0%
2341412.7%
2341512.7%
23416100.0%
23417100.0%
2341836.5%
23419100.0%
23420100.0%
2342190.5%
2342238.2%
23423100.0%
23424100.0%
23425100.0%
2342650.8%
2342713.2%
23428100.0%
23429100.0%
23430100.0%
2343113.1%
54 Temporal blocking2.6%
Thread Id% of bandwidth
Thread Total2.6%
234080.0%
234090.0%
234100.0%
234110.0%
234120.3%
234130.3%
234140.3%
234150.3%
234160.0%
234170.0%
234180.3%
234190.0%
234200.0%
234210.0%
234220.3%
234230.0%
234240.0%
234250.0%
234260.3%
234270.0%
234280.0%
234290.0%
234300.0%
234310.3%
1.9%
Thread Id% of fetches
Thread Total1.9%
234080.0%
234090.0%
234100.0%
234110.0%
234120.2%
234130.2%
234140.2%
234150.2%
234160.0%
234170.0%
234180.2%
234190.0%
234200.0%
234210.0%
234220.2%
234230.0%
234240.0%
234250.0%
234260.2%
234270.0%
234280.0%
234290.0%
234300.0%
234310.2%
4.1%
Thread Id% of write-backs
Thread Total4.1%
234080.0%
234090.0%
234100.0%
234110.0%
234120.5%
234130.5%
234140.5%
234150.5%
234160.0%
234170.0%
234180.5%
234190.0%
234200.0%
234210.0%
234220.5%
234230.0%
234240.0%
234250.0%
234260.5%
234270.0%
234280.0%
234290.0%
234300.0%
234310.5%
47.8%
Thread IdFetch utilization
Thread Average47.8%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
2341226.8%
2341313.0%
234140.1%
2341525.8%
23416100.0%
23417100.0%
2341826.4%
23419100.0%
23420100.0%
23421100.0%
234221.0%
23423100.0%
23424100.0%
23425100.0%
2342613.6%
23427100.0%
23428100.0%
23429100.0%
23430100.0%
2343138.1%
50.2%
Thread IdWrite-back utilization
Thread Average50.2%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
2341226.2%
2341325.0%
2341412.7%
2341512.7%
23416100.0%
23417100.0%
2341836.5%
23419100.0%
23420100.0%
2342190.5%
2342238.2%
23423100.0%
23424100.0%
23425100.0%
2342650.8%
2342713.2%
23428100.0%
23429100.0%
23430100.0%
2343113.1%

Latency issues related to this this loop

# Issue type % of misses HW-Prefetch Randomness Fetch utilization
37 Inefficient loop nesting1.9%
Thread Id% of misses
Thread Total1.9%
234080.0%
234090.0%
234100.0%
234110.0%
234120.2%
234130.2%
234140.2%
234150.2%
234160.0%
234170.0%
234180.2%
234190.0%
234200.0%
234210.0%
234220.2%
234230.0%
234240.0%
234250.0%
234260.2%
234270.0%
234280.0%
234290.0%
234300.0%
234310.2%
0.0%
Thread IdHW prefetch probability
Thread Average0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%
Low
Thread IdAccess randomness
Thread AverageLow
23408Low
23409Low
23410Low
23411Low
23412Low
23413Low
23414Low
23415Low
23416Low
23417Low
23418Low
23419Low
23420Low
23421Low
23422Low
23423Low
23424Low
23425Low
23426Low
23427Low
23428Low
23429Low
23430Low
23431Low
47.8%
Thread IdFetch utilization
Thread Average47.8%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
2341226.8%
2341313.0%
234140.1%
2341525.8%
23416100.0%
23417100.0%
2341826.4%
23419100.0%
23420100.0%
23421100.0%
234221.0%
23423100.0%
23424100.0%
23425100.0%
2342613.6%
23427100.0%
23428100.0%
23429100.0%
23430100.0%
2343138.1%
54 Temporal blocking1.9%
Thread Id% of misses
Thread Total1.9%
234080.0%
234090.0%
234100.0%
234110.0%
234120.2%
234130.2%
234140.2%
234150.2%
234160.0%
234170.0%
234180.2%
234190.0%
234200.0%
234210.0%
234220.2%
234230.0%
234240.0%
234250.0%
234260.2%
234270.0%
234280.0%
234290.0%
234300.0%
234310.2%
0.0%
Thread IdHW prefetch probability
Thread Average0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%
Low
Thread IdAccess randomness
Thread AverageLow
23408Low
23409Low
23410Low
23411Low
23412Low
23413Low
23414Low
23415Low
23416Low
23417Low
23418Low
23419Low
23420Low
23421Low
23422Low
23423Low
23424Low
23425Low
23426Low
23427Low
23428Low
23429Low
23430Low
23431Low
47.8%
Thread IdFetch utilization
Thread Average47.8%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
2341226.8%
2341313.0%
234140.1%
2341525.8%
23416100.0%
23417100.0%
2341826.4%
23419100.0%
23420100.0%
23421100.0%
234221.0%
23423100.0%
23424100.0%
23425100.0%
2342613.6%
23427100.0%
23428100.0%
23429100.0%
23430100.0%
2343138.1%

MT issues related to this this loop

# Issue type % of communication Communication utilization False sharing
73 Communication utilization9.9%
Thread Id% of comm
Thread Total9.9%
234080.0%
234091.6%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234171.6%
234180.0%
234191.6%
234200.0%
234211.6%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234291.6%
234301.6%
234310.0%
35.7%
Thread IdComm. utilization
Thread Average35.7%
23408100.0%
2340912.5%
23410100.0%
23411100.0%
23412100.0%
23413100.0%
23414100.0%
23415100.0%
23416100.0%
2341712.5%
23418100.0%
2341912.5%
23420100.0%
2342116.6%
23422100.0%
23423100.0%
23424100.0%
23425100.0%
23426100.0%
23427100.0%
23428100.0%
2342912.5%
2343037.5%
23431100.0%
0.0%
Thread IdF-S. ratio
Thread Average0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%

Instruction groups in this loop

Group % of misses % of fetches Fetch utilization Write-back utilization HW prefetch probability Randomness Issues
1 1.9%
Thread Id % of misses
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
1.9%
Thread Id % of fetches
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
47.8%
Thread Id Fetch utilization
Thread Average 47.8%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.8%
23413 13.0%
23414 0.1%
23415 25.8%
23416 100.0%
23417 100.0%
23418 26.4%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.6%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.1%
50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%
0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low

Instruction group 1

Accesses 2.98e+09
Thread Id Accesses
Thread Total 2.98e+09
23408 8.67e+07
23409 1.45e+08
23410 1.16e+08
23411 1.01e+08
23412 1.59e+08
23413 5.78e+07
23414 1.45e+08
23415 1.01e+08
23416 8.67e+07
23417 1.30e+08
23418 1.59e+08
23419 5.78e+07
23420 1.59e+08
23421 1.59e+08
23422 1.45e+08
23423 1.73e+08
23424 1.30e+08
23425 8.67e+07
23426 1.73e+08
23427 1.16e+08
23428 1.16e+08
23429 1.16e+08
23430 1.45e+08
23431 1.16e+08
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 1.9%
Thread Id % of misses
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
% of bandwidth 2.6%
Thread Id % of bandwidth
Thread Total 2.6%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.3%
23413 0.3%
23414 0.3%
23415 0.3%
23416 0.0%
23417 0.0%
23418 0.3%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.3%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.3%
% of fetches 1.9%
Thread Id % of fetches
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
% of write-backs 4.1%
Thread Id % of write-backs
Thread Total 4.1%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.5%
23413 0.5%
23414 0.5%
23415 0.5%
23416 0.0%
23417 0.0%
23418 0.5%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.5%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.5%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.5%
% of upgrades 15.2%
Thread Id % of upgrades
Thread Total 15.2%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 2.5%
23414 2.5%
23415 2.5%
23416 0.0%
23417 0.0%
23418 2.5%
23419 0.0%
23420 0.0%
23421 0.0%
23422 2.5%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 2.5%
Miss ratio 2.0%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 2.0% 0.0% 0.5% 1.5% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 4.4% 0.0% 4.4% 0.0% 0.0%
23413 12.5% 0.0% 0.0% 12.5% 0.0%
23414 5.0% 0.0% 0.0% 5.0% 0.0%
23415 7.1% 0.0% 0.0% 7.1% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 4.5% 0.0% 0.0% 4.5% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 5.1% 0.0% 0.1% 5.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 4.2% 0.0% 4.2% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 6.3% 0.0% 0.0% 6.2% 0.0%
Fetch ratio 2.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 2.0% 0.0% 0.5% 1.5% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 4.4% 0.0% 4.4% 0.0% 0.0%
23413 12.5% 0.0% 0.0% 12.5% 0.0%
23414 5.0% 0.0% 0.0% 5.0% 0.0%
23415 7.1% 0.0% 0.0% 7.1% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 4.5% 0.0% 0.0% 4.5% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 5.1% 0.0% 0.1% 5.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 4.2% 0.0% 4.2% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 6.3% 0.0% 0.0% 6.2% 0.0%
Write-back ratio 2.0%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 2.0% 0.0% 0.5% 1.5% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 4.4% 0.0% 4.4% 0.0% 0.0%
23413 12.5% 0.0% 0.0% 12.5% 0.0%
23414 5.0% 0.0% 0.0% 5.0% 0.0%
23415 7.1% 0.0% 0.0% 7.1% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 4.5% 0.0% 0.0% 4.5% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 5.1% 0.0% 0.1% 5.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 4.2% 0.0% 4.2% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 6.3% 0.0% 0.0% 6.2% 0.0%
Upgrade ratio 1.5%
Thread Id Upgrade ratio
Thread Average 1.5%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 12.5%
23414 5.0%
23415 7.1%
23416 0.0%
23417 0.0%
23418 4.5%
23419 0.0%
23420 0.0%
23421 0.0%
23422 5.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 6.2%
Communication ratio 2.9%
Thread Id Comm. ratio
Thread Average 2.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 25.0%
23414 10.0%
23415 14.3%
23416 0.0%
23417 0.0%
23418 9.1%
23419 0.0%
23420 0.0%
23421 0.0%
23422 10.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 12.5%
Fetch utilization 47.8%
Thread Id Fetch utilization
Thread Average 47.8%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.8%
23413 13.0%
23414 0.1%
23415 25.8%
23416 100.0%
23417 100.0%
23418 26.4%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.6%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.1%
Write-back utilization 50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%
Communication utilization 35.7%
Thread Id Comm. utilization
Thread Average 35.7%
23408 100.0%
23409 12.5%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 12.5%
23418 100.0%
23419 12.5%
23420 100.0%
23421 16.6%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 12.5%
23430 37.5%
23431 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low
Worst instruction "octotiger"!grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x690 (0x9dcf70) [R], grid_fmm.cpp:753

The following issues are detected for this instruction group:

  • Inefficient loop nesting, issue: 37
  • Temporal blocking, issue: 54
  • Communication utilization, issue: 73

Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x690 (0x9dcf70) [R], grid_fmm.cpp:753 1.9%
Thread Id % of misses
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
1.9%
Thread Id % of fetches
Thread Total 1.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.2%
23413 0.2%
23414 0.2%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.2%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.2%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.2%
3.9%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 3.9% 0.0% 1.0% 2.9% 0.0%
23408 0.2% 0.0% 0.2% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 8.8% 0.0% 8.8% 0.0% 0.0%
23413 25.0% 0.0% 0.0% 25.0% 0.0%
23414 10.0% 0.0% 0.0% 10.0% 0.0%
23415 14.3% 0.0% 0.0% 14.3% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 9.1% 0.0% 0.0% 9.1% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.3% 0.0% 0.3% 0.0% 0.0%
23422 10.2% 0.0% 0.2% 10.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.2% 0.0% 0.2% 0.0% 0.0%
23426 8.3% 0.0% 8.3% 0.0% 0.0%
23427 0.2% 0.0% 0.2% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 12.6% 0.0% 0.1% 12.5% 0.0%
47.8%
Thread Id Fetch utilization
Thread Average 47.8%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.8%
23413 13.0%
23414 0.1%
23415 25.8%
23416 100.0%
23417 100.0%
23418 26.4%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.6%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.1%
50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%
"octotiger"!grid::compute_boundary_interactions_monopole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x69a (0x9dcf7a) [W], grid_fmm.cpp:753 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
47.8%
Thread Id Fetch utilization
Thread Average 47.8%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.8%
23413 13.0%
23414 0.1%
23415 25.8%
23416 100.0%
23417 100.0%
23418 26.4%
23419 100.0%
23420 100.0%
23421 100.0%
23422 1.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 13.6%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 38.1%
50.2%
Thread Id Write-back utilization
Thread Average 50.2%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 26.2%
23413 25.0%
23414 12.7%
23415 12.7%
23416 100.0%
23417 100.0%
23418 36.5%
23419 100.0%
23420 100.0%
23421 90.5%
23422 38.2%
23423 100.0%
23424 100.0%
23425 100.0%
23426 50.8%
23427 13.2%
23428 100.0%
23429 100.0%
23430 100.0%
23431 13.1%

Copyright (c) 2006-2012 Rogue Wave Software, Inc. All Rights Reserved.
Patents pending.