Loop 7

Loop statistics

Accesses 4.85e+09
Thread Id Accesses
Thread Total 4.85e+09
23408 1.81e+08
23409 2.02e+08
23410 2.10e+08
23411 2.02e+08
23412 2.17e+08
23413 1.59e+08
23414 2.31e+08
23415 2.31e+08
23416 3.04e+08
23417 2.24e+08
23418 1.88e+08
23419 2.31e+08
23420 2.17e+08
23421 1.88e+08
23422 2.02e+08
23423 1.52e+08
23424 2.17e+08
23425 2.24e+08
23426 1.81e+08
23427 1.81e+08
23428 1.95e+08
23429 1.52e+08
23430 2.10e+08
23431 1.52e+08
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 1.5%
Thread Id % of misses
Thread Total 1.5%
23408 0.5%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
% of bandwidth 1.0%
Thread Id % of bandwidth
Thread Total 1.0%
23408 0.3%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
% of fetches 1.5%
Thread Id % of fetches
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
% of write-backs 0.0%
Thread Id % of write-backs
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
% of upgrades 0.0%
Thread Id % of upgrades
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Miss ratio 0.9%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 0.9% 0.0% 0.9% 0.0% 0.0%
23408 7.6% 0.0% 7.6% 0.0% 0.0%
23409 0.1% 0.0% 0.1% 0.0% 0.0%
23410 0.1% 0.0% 0.1% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.1% 0.0% 0.1% 0.0% 0.0%
23414 0.2% 0.0% 0.2% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.9% 0.0% 3.9% 0.0% 0.0%
23419 0.1% 0.0% 0.1% 0.0% 0.0%
23420 3.3% 0.0% 3.3% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 3.3% 0.0% 3.3% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 3.5% 0.0% 3.5% 0.0% 0.0%
23431 0.1% 0.0% 0.1% 0.0% 0.0%
Fetch ratio 0.9%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.9% 0.0% 0.9% 0.0% 0.0%
23408 7.6% 0.0% 7.6% 0.0% 0.0%
23409 0.1% 0.0% 0.1% 0.0% 0.0%
23410 0.1% 0.0% 0.1% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.1% 0.0% 0.1% 0.0% 0.0%
23414 0.2% 0.0% 0.2% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.9% 0.0% 3.9% 0.0% 0.0%
23419 0.1% 0.0% 0.1% 0.0% 0.0%
23420 3.3% 0.0% 3.3% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 3.3% 0.0% 3.3% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 3.5% 0.0% 3.5% 0.0% 0.0%
23431 0.1% 0.0% 0.1% 0.0% 0.0%
Write-back ratio 0.0%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.1% 0.0% 0.1% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
Upgrade ratio 0.0%
Thread Id Upgrade ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Communication ratio 0.0%
Thread Id Comm. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Fetch utilization 27.0%
Thread Id Fetch utilization
Thread Average 27.0%
23408 12.7%
23409 100.0%
23410 50.5%
23411 100.0%
23412 100.0%
23413 23.6%
23414 15.3%
23415 100.0%
23416 100.0%
23417 65.0%
23418 22.3%
23419 91.1%
23420 13.3%
23421 57.4%
23422 24.8%
23423 100.0%
23424 13.9%
23425 51.1%
23426 100.0%
23427 100.0%
23428 19.7%
23429 100.0%
23430 25.2%
23431 21.1%
Write-back utilization 66.0%
Thread Id Write-back utilization
Thread Average 66.0%
23408 100.0%
23409 100.0%
23410 35.6%
23411 41.9%
23412 88.9%
23413 100.0%
23414 91.5%
23415 97.7%
23416 42.8%
23417 74.7%
23418 25.5%
23419 100.0%
23420 62.1%
23421 88.5%
23422 91.7%
23423 100.0%
23424 100.0%
23425 47.9%
23426 100.0%
23427 94.1%
23428 40.9%
23429 69.9%
23430 28.7%
23431 63.1%
Communication utilization 100.0%
Thread Id Comm. utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low

Loop instructions

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 26.1% ]
       "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 26.1% ]
          "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 26.1% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 26.4% ]
                "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 26.4% ]
                   "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 26.4% ]
                      "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 26.4% ]
                         "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 43.5% ]
                            "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 43.9% ]
                               "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 48.4% ]
                                  "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 62.2% ]
                                     "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 62.2% ]
                                        "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 96.8% ]
                                           "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 96.8% ]
                                              "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88 [ 96.8% ]
                                                 "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196 [ 96.8% ]
                                                    "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36 [ 96.8% ]
                                                       "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196 [ 96.8% ]
                                                          "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36 [ 96.8% ]
                                                             "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782 [ 96.8% ]
                                                                "octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x3e (0x9e3d4e), invoke.hpp:36
"octotiger"!grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x298 (0x9dd918) [R], simdarray.h:952 1.5%
Thread Id % of misses
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
1.5%
Thread Id % of fetches
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
2.6%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 2.6% 0.0% 2.6% 0.0% 0.0%
23408 18.9% 0.0% 18.9% 0.0% 0.0%
23409 0.2% 0.0% 0.2% 0.0% 0.0%
23410 0.3% 0.0% 0.3% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.6% 0.0% 0.6% 0.0% 0.0%
23414 0.4% 0.0% 0.4% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 12.5% 0.0% 12.5% 0.0% 0.0%
23419 0.1% 0.0% 0.1% 0.0% 0.0%
23420 9.9% 0.0% 9.9% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 9.0% 0.0% 9.0% 0.0% 0.0%
23425 0.3% 0.0% 0.3% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.6% 0.0% 0.6% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 7.2% 0.0% 7.2% 0.0% 0.0%
23431 0.3% 0.0% 0.3% 0.0% 0.0%
26.9%
Thread Id Fetch utilization
Thread Average 26.9%
23408 12.8%
23409 100.0%
23410 41.7%
23411 100.0%
23412 100.0%
23413 22.5%
23414 12.5%
23415 100.0%
23416 100.0%
23417 100.0%
23418 22.3%
23419 100.0%
23420 12.9%
23421 100.0%
23422 100.0%
23423 100.0%
23424 14.0%
23425 53.7%
23426 100.0%
23427 100.0%
23428 21.1%
23429 100.0%
23430 25.1%
23431 25.8%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
"octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 26.0% ]
       "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 26.0% ]
          "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 26.2% ]
             "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 26.2% ]
                "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 26.2% ]
                   "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 26.2% ]
                      "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 41.3% ]
                         "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 42.0% ]
                            "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 45.6% ]
                               "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 64.6% ]
                                  "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 64.6% ]
                                     "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 97.7% ]
                                        "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 97.7% ]
                                           "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88 [ 97.7% ]
                                              "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196 [ 97.7% ]
                                                 "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36 [ 97.7% ]
                                                    "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196 [ 97.7% ]
                                                       "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36 [ 97.7% ]
                                                          "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782 [ 97.7% ]
                                                             "octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x3e (0x9e3d4e), invoke.hpp:36
"octotiger"!grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x29e (0x9dd91e) [R], simdarray.h:952 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.1% 0.0% 0.1% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.1% 0.0% 0.1% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.1% 0.0% 0.1% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
49.5%
Thread Id Fetch utilization
Thread Average 49.5%
23408 0.0%
23409 100.0%
23410 100.0%
23411 0.0%
23412 0.0%
23413 51.7%
23414 78.5%
23415 38.5%
23416 50.6%
23417 71.9%
23418 51.4%
23419 0.0%
23420 100.0%
23421 9.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 51.1%
23426 100.0%
23427 0.1%
23428 0.1%
23429 0.0%
23430 62.4%
23431 0.0%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
"octotiger"!grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x2a8 (0x9dd928) [W], simdarray.h:952 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.1% 0.0% 0.1% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
66.0%
Thread Id Write-back utilization
Thread Average 66.0%
23408 100.0%
23409 100.0%
23410 35.6%
23411 41.9%
23412 88.9%
23413 100.0%
23414 91.5%
23415 97.7%
23416 42.8%
23417 74.7%
23418 25.5%
23419 100.0%
23420 62.1%
23421 88.5%
23422 91.7%
23423 100.0%
23424 100.0%
23425 47.9%
23426 100.0%
23427 94.1%
23428 40.9%
23429 69.9%
23430 28.7%
23431 63.1%

Bandwidth issues related to this this loop

# Issue type % of bandwidth % of fetches % of write-backs Fetch utilization Write-back utilization
32 Inefficient loop nesting1.0%
Thread Id% of bandwidth
Thread Total1.0%
234080.3%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.2%
234190.0%
234200.2%
234210.0%
234220.0%
234230.0%
234240.2%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.2%
234310.0%
1.5%
Thread Id% of fetches
Thread Total1.5%
234080.4%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.2%
234190.0%
234200.2%
234210.0%
234220.0%
234230.0%
234240.2%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.2%
234310.0%
0.0%
Thread Id% of write-backs
Thread Total0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%
26.9%
Thread IdFetch utilization
Thread Average26.9%
2340812.8%
23409100.0%
2341041.7%
23411100.0%
23412100.0%
2341322.5%
2341412.5%
23415100.0%
23416100.0%
23417100.0%
2341822.3%
23419100.0%
2342012.9%
23421100.0%
23422100.0%
23423100.0%
2342414.0%
2342553.7%
23426100.0%
23427100.0%
2342821.1%
23429100.0%
2343025.1%
2343125.8%
100.0%
Thread IdWrite-back utilization
Thread Average100.0%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
23412100.0%
23413100.0%
23414100.0%
23415100.0%
23416100.0%
23417100.0%
23418100.0%
23419100.0%
23420100.0%
23421100.0%
23422100.0%
23423100.0%
23424100.0%
23425100.0%
23426100.0%
23427100.0%
23428100.0%
23429100.0%
23430100.0%
23431100.0%
50 Spat/temp blocking1.0%
Thread Id% of bandwidth
Thread Total1.0%
234080.3%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.2%
234190.0%
234200.2%
234210.0%
234220.0%
234230.0%
234240.2%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.2%
234310.0%
1.5%
Thread Id% of fetches
Thread Total1.5%
234080.4%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.2%
234190.0%
234200.2%
234210.0%
234220.0%
234230.0%
234240.2%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.2%
234310.0%
0.0%
Thread Id% of write-backs
Thread Total0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%
26.9%
Thread IdFetch utilization
Thread Average26.9%
2340812.8%
23409100.0%
2341041.7%
23411100.0%
23412100.0%
2341322.5%
2341412.5%
23415100.0%
23416100.0%
23417100.0%
2341822.3%
23419100.0%
2342012.9%
23421100.0%
23422100.0%
23423100.0%
2342414.0%
2342553.7%
23426100.0%
23427100.0%
2342821.1%
23429100.0%
2343025.1%
2343125.8%
100.0%
Thread IdWrite-back utilization
Thread Average100.0%
23408100.0%
23409100.0%
23410100.0%
23411100.0%
23412100.0%
23413100.0%
23414100.0%
23415100.0%
23416100.0%
23417100.0%
23418100.0%
23419100.0%
23420100.0%
23421100.0%
23422100.0%
23423100.0%
23424100.0%
23425100.0%
23426100.0%
23427100.0%
23428100.0%
23429100.0%
23430100.0%
23431100.0%

Latency issues related to this this loop

# Issue type % of misses HW-Prefetch Randomness Fetch utilization
32 Inefficient loop nesting1.5%
Thread Id% of misses
Thread Total1.5%
234080.4%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.2%
234190.0%
234200.2%
234210.0%
234220.0%
234230.0%
234240.2%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.2%
234310.0%
0.0%
Thread IdHW prefetch probability
Thread Average0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%
Low
Thread IdAccess randomness
Thread AverageLow
23408Low
23409Low
23410Low
23411Low
23412Low
23413Low
23414Low
23415Low
23416Low
23417Low
23418Low
23419Low
23420Low
23421Low
23422Low
23423Low
23424Low
23425Low
23426Low
23427Low
23428Low
23429Low
23430Low
23431Low
26.9%
Thread IdFetch utilization
Thread Average26.9%
2340812.8%
23409100.0%
2341041.7%
23411100.0%
23412100.0%
2341322.5%
2341412.5%
23415100.0%
23416100.0%
23417100.0%
2341822.3%
23419100.0%
2342012.9%
23421100.0%
23422100.0%
23423100.0%
2342414.0%
2342553.7%
23426100.0%
23427100.0%
2342821.1%
23429100.0%
2343025.1%
2343125.8%
50 Spat/temp blocking1.5%
Thread Id% of misses
Thread Total1.5%
234080.4%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.2%
234190.0%
234200.2%
234210.0%
234220.0%
234230.0%
234240.2%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.2%
234310.0%
0.0%
Thread IdHW prefetch probability
Thread Average0.0%
234080.0%
234090.0%
234100.0%
234110.0%
234120.0%
234130.0%
234140.0%
234150.0%
234160.0%
234170.0%
234180.0%
234190.0%
234200.0%
234210.0%
234220.0%
234230.0%
234240.0%
234250.0%
234260.0%
234270.0%
234280.0%
234290.0%
234300.0%
234310.0%
Low
Thread IdAccess randomness
Thread AverageLow
23408Low
23409Low
23410Low
23411Low
23412Low
23413Low
23414Low
23415Low
23416Low
23417Low
23418Low
23419Low
23420Low
23421Low
23422Low
23423Low
23424Low
23425Low
23426Low
23427Low
23428Low
23429Low
23430Low
23431Low
26.9%
Thread IdFetch utilization
Thread Average26.9%
2340812.8%
23409100.0%
2341041.7%
23411100.0%
23412100.0%
2341322.5%
2341412.5%
23415100.0%
23416100.0%
23417100.0%
2341822.3%
23419100.0%
2342012.9%
23421100.0%
23422100.0%
23423100.0%
2342414.0%
2342553.7%
23426100.0%
23427100.0%
2342821.1%
23429100.0%
2343025.1%
2343125.8%

Instruction groups in this loop

Group % of misses % of fetches Fetch utilization Write-back utilization HW prefetch probability Randomness Issues
1 1.5%
Thread Id % of misses
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
1.5%
Thread Id % of fetches
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
26.9%
Thread Id Fetch utilization
Thread Average 26.9%
23408 12.8%
23409 100.0%
23410 41.7%
23411 100.0%
23412 100.0%
23413 22.5%
23414 12.5%
23415 100.0%
23416 100.0%
23417 100.0%
23418 22.3%
23419 100.0%
23420 12.9%
23421 100.0%
23422 100.0%
23423 100.0%
23424 14.0%
23425 53.7%
23426 100.0%
23427 100.0%
23428 21.1%
23429 100.0%
23430 25.1%
23431 25.8%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low

Instruction group 1

Accesses 1.68e+09
Thread Id Accesses
Thread Total 1.68e+09
23408 7.23e+07
23409 7.95e+07
23410 6.51e+07
23411 5.06e+07
23412 5.06e+07
23413 2.89e+07
23414 9.40e+07
23415 5.06e+07
23416 1.30e+08
23417 7.95e+07
23418 5.78e+07
23419 1.08e+08
23420 7.23e+07
23421 5.06e+07
23422 7.23e+07
23423 5.06e+07
23424 7.95e+07
23425 5.78e+07
23426 5.78e+07
23427 8.67e+07
23428 6.51e+07
23429 6.51e+07
23430 1.01e+08
23431 5.78e+07
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 1.5%
Thread Id % of misses
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
% of bandwidth 1.0%
Thread Id % of bandwidth
Thread Total 1.0%
23408 0.3%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
% of fetches 1.5%
Thread Id % of fetches
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
% of write-backs 0.0%
Thread Id % of write-backs
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
% of upgrades 0.0%
Thread Id % of upgrades
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Miss ratio 2.6%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 2.6% 0.0% 2.6% 0.0% 0.0%
23408 18.9% 0.0% 18.9% 0.0% 0.0%
23409 0.2% 0.0% 0.2% 0.0% 0.0%
23410 0.3% 0.0% 0.3% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.6% 0.0% 0.6% 0.0% 0.0%
23414 0.4% 0.0% 0.4% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 12.5% 0.0% 12.5% 0.0% 0.0%
23419 0.1% 0.0% 0.1% 0.0% 0.0%
23420 9.9% 0.0% 9.9% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 9.0% 0.0% 9.0% 0.0% 0.0%
23425 0.3% 0.0% 0.3% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.6% 0.0% 0.6% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 7.2% 0.0% 7.2% 0.0% 0.0%
23431 0.3% 0.0% 0.3% 0.0% 0.0%
Fetch ratio 2.6%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 2.6% 0.0% 2.6% 0.0% 0.0%
23408 18.9% 0.0% 18.9% 0.0% 0.0%
23409 0.2% 0.0% 0.2% 0.0% 0.0%
23410 0.3% 0.0% 0.3% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.6% 0.0% 0.6% 0.0% 0.0%
23414 0.4% 0.0% 0.4% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 12.5% 0.0% 12.5% 0.0% 0.0%
23419 0.1% 0.0% 0.1% 0.0% 0.0%
23420 9.9% 0.0% 9.9% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 9.0% 0.0% 9.0% 0.0% 0.0%
23425 0.3% 0.0% 0.3% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.6% 0.0% 0.6% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 7.2% 0.0% 7.2% 0.0% 0.0%
23431 0.3% 0.0% 0.3% 0.0% 0.0%
Write-back ratio 0.0%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
Upgrade ratio 0.0%
Thread Id Upgrade ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Communication ratio 0.0%
Thread Id Comm. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Fetch utilization 26.9%
Thread Id Fetch utilization
Thread Average 26.9%
23408 12.8%
23409 100.0%
23410 41.7%
23411 100.0%
23412 100.0%
23413 22.5%
23414 12.5%
23415 100.0%
23416 100.0%
23417 100.0%
23418 22.3%
23419 100.0%
23420 12.9%
23421 100.0%
23422 100.0%
23423 100.0%
23424 14.0%
23425 53.7%
23426 100.0%
23427 100.0%
23428 21.1%
23429 100.0%
23430 25.1%
23431 25.8%
Write-back utilization 100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
Communication utilization 100.0%
Thread Id Comm. utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low
Worst instruction "octotiger"!grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x298 (0x9dd918) [R], simdarray.h:952

The following issues are detected for this instruction group:

  • Inefficient loop nesting, issue: 32
  • Spat/temp blocking, issue: 50

Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!grid::compute_boundary_interactions_multipole_multipole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x298 (0x9dd918) [R], simdarray.h:952 1.5%
Thread Id % of misses
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
1.5%
Thread Id % of fetches
Thread Total 1.5%
23408 0.4%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.2%
23419 0.0%
23420 0.2%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.2%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.2%
23431 0.0%
2.6%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 2.6% 0.0% 2.6% 0.0% 0.0%
23408 18.9% 0.0% 18.9% 0.0% 0.0%
23409 0.2% 0.0% 0.2% 0.0% 0.0%
23410 0.3% 0.0% 0.3% 0.0% 0.0%
23411 0.1% 0.0% 0.1% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.6% 0.0% 0.6% 0.0% 0.0%
23414 0.4% 0.0% 0.4% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 12.5% 0.0% 12.5% 0.0% 0.0%
23419 0.1% 0.0% 0.1% 0.0% 0.0%
23420 9.9% 0.0% 9.9% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 9.0% 0.0% 9.0% 0.0% 0.0%
23425 0.3% 0.0% 0.3% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.6% 0.0% 0.6% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 7.2% 0.0% 7.2% 0.0% 0.0%
23431 0.3% 0.0% 0.3% 0.0% 0.0%
26.9%
Thread Id Fetch utilization
Thread Average 26.9%
23408 12.8%
23409 100.0%
23410 41.7%
23411 100.0%
23412 100.0%
23413 22.5%
23414 12.5%
23415 100.0%
23416 100.0%
23417 100.0%
23418 22.3%
23419 100.0%
23420 12.9%
23421 100.0%
23422 100.0%
23423 100.0%
23424 14.0%
23425 53.7%
23426 100.0%
23427 100.0%
23428 21.1%
23429 100.0%
23430 25.1%
23431 25.8%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%

Copyright (c) 2006-2012 Rogue Wave Software, Inc. All Rights Reserved.
Patents pending.