Issue #44: Loop fusion

It may be possible to merge the bodies of loop #3 and loop #1 by moving the first loop down.

Statistics for fusible instruction group, second loop #1

Accesses 1.57e+10
Thread Id Accesses
Thread Total 1.57e+10
23408 7.66e+08
23409 5.93e+08
23410 5.35e+08
23411 6.94e+08
23412 8.53e+08
23413 5.20e+08
23414 6.36e+08
23415 7.95e+08
23416 6.65e+08
23417 6.94e+08
23418 5.78e+08
23419 8.24e+08
23420 8.82e+08
23421 5.78e+08
23422 5.78e+08
23423 6.07e+08
23424 6.07e+08
23425 6.36e+08
23426 7.08e+08
23427 4.63e+08
23428 7.52e+08
23429 4.63e+08
23430 6.80e+08
23431 5.93e+08
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 6.4%
Thread Id % of misses
Thread Total 6.4%
23408 0.2%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.6%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
% of bandwidth 8.7%
Thread Id % of bandwidth
Thread Total 8.7%
23408 0.3%
23409 1.0%
23410 0.0%
23411 0.4%
23412 0.0%
23413 0.0%
23414 0.6%
23415 0.8%
23416 0.0%
23417 0.0%
23418 1.0%
23419 0.7%
23420 0.0%
23421 1.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.7%
23426 0.3%
23427 0.7%
23428 0.7%
23429 0.3%
23430 0.0%
23431 0.3%
% of fetches 6.4%
Thread Id % of fetches
Thread Total 6.4%
23408 0.2%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.6%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
% of write-backs 13.7%
Thread Id % of write-backs
Thread Total 13.7%
23408 0.5%
23409 1.6%
23410 0.0%
23411 0.6%
23412 0.0%
23413 0.0%
23414 0.9%
23415 1.2%
23416 0.0%
23417 0.0%
23418 1.6%
23419 1.0%
23420 0.0%
23421 1.5%
23422 0.0%
23423 0.0%
23424 0.0%
23425 1.0%
23426 0.5%
23427 1.0%
23428 1.0%
23429 0.5%
23430 0.0%
23431 0.5%
% of upgrades 20.8%
Thread Id % of upgrades
Thread Total 20.8%
23408 0.3%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 2.5%
23416 0.0%
23417 0.0%
23418 5.1%
23419 2.5%
23420 0.0%
23421 3.5%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.2%
23426 0.1%
23427 2.5%
23428 1.3%
23429 2.5%
23430 0.0%
23431 0.2%
Miss ratio 1.2%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 1.2% 0.0% 0.9% 0.4% 0.0%
23408 1.0% 0.0% 0.9% 0.1% 0.0%
23409 3.8% 0.0% 3.8% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 1.2% 0.0% 1.2% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 2.0% 0.0% 2.0% 0.0% 0.0%
23415 2.2% 0.0% 1.2% 0.9% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.9% 0.0% 1.4% 2.5% 0.0%
23419 1.8% 0.0% 0.9% 0.9% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 3.8% 0.0% 2.0% 1.7% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 2.3% 0.0% 2.2% 0.1% 0.0%
23426 1.1% 0.0% 1.0% 0.0% 0.0%
23427 3.1% 0.0% 1.6% 1.6% 0.0%
23428 2.0% 0.0% 1.5% 0.5% 0.0%
23429 1.6% 0.0% 0.0% 1.6% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 1.2% 0.0% 1.1% 0.1% 0.0%
Fetch ratio 1.2%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 1.2% 0.0% 0.9% 0.4% 0.0%
23408 1.0% 0.0% 0.9% 0.1% 0.0%
23409 3.8% 0.0% 3.8% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 1.2% 0.0% 1.2% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 2.0% 0.0% 2.0% 0.0% 0.0%
23415 2.2% 0.0% 1.2% 0.9% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.9% 0.0% 1.4% 2.5% 0.0%
23419 1.8% 0.0% 0.9% 0.9% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 3.8% 0.0% 2.0% 1.7% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 2.3% 0.0% 2.2% 0.1% 0.0%
23426 1.1% 0.0% 1.0% 0.0% 0.0%
23427 3.1% 0.0% 1.6% 1.6% 0.0%
23428 2.0% 0.0% 1.5% 0.5% 0.0%
23429 1.6% 0.0% 0.0% 1.6% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 1.2% 0.0% 1.1% 0.1% 0.0%
Write-back ratio 1.2%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 1.2% 0.0% 0.9% 0.4% 0.0%
23408 1.0% 0.0% 0.9% 0.1% 0.0%
23409 3.8% 0.0% 3.8% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 1.2% 0.0% 1.2% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 2.0% 0.0% 2.0% 0.0% 0.0%
23415 2.2% 0.0% 1.2% 0.9% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.9% 0.0% 1.4% 2.5% 0.0%
23419 1.8% 0.0% 0.9% 0.9% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 3.8% 0.0% 2.0% 1.7% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 2.3% 0.0% 2.2% 0.1% 0.0%
23426 1.1% 0.0% 1.0% 0.0% 0.0%
23427 3.1% 0.0% 1.6% 1.6% 0.0%
23428 2.0% 0.0% 1.5% 0.5% 0.0%
23429 1.6% 0.0% 0.0% 1.6% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 1.2% 0.0% 1.1% 0.1% 0.0%
Upgrade ratio 0.4%
Thread Id Upgrade ratio
Thread Average 0.4%
23408 0.1%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.9%
23416 0.0%
23417 0.0%
23418 2.5%
23419 0.9%
23420 0.0%
23421 1.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.1%
23426 0.0%
23427 1.6%
23428 0.5%
23429 1.6%
23430 0.0%
23431 0.1%
Communication ratio 0.8%
Thread Id Comm. ratio
Thread Average 0.8%
23408 0.2%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 1.8%
23416 0.0%
23417 0.0%
23418 5.0%
23419 1.8%
23420 0.0%
23421 3.4%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.2%
23426 0.1%
23427 3.1%
23428 1.0%
23429 3.1%
23430 0.0%
23431 0.2%
Fetch utilization 56.3%
Thread Id Fetch utilization
Thread Average 56.3%
23408 25.1%
23409 16.5%
23410 100.0%
23411 42.2%
23412 100.0%
23413 100.0%
23414 27.9%
23415 32.3%
23416 100.0%
23417 100.0%
23418 22.7%
23419 13.1%
23420 100.0%
23421 21.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 37.8%
23426 28.3%
23427 19.1%
23428 37.1%
23429 58.4%
23430 100.0%
23431 100.0%
Write-back utilization 46.9%
Thread Id Write-back utilization
Thread Average 46.9%
23408 73.7%
23409 32.4%
23410 100.0%
23411 34.6%
23412 100.0%
23413 100.0%
23414 67.1%
23415 38.5%
23416 100.0%
23417 100.0%
23418 12.7%
23419 16.2%
23420 100.0%
23421 29.3%
23422 100.0%
23423 100.0%
23424 100.0%
23425 19.6%
23426 17.6%
23427 12.7%
23428 35.3%
23429 37.7%
23430 100.0%
23431 62.3%
Communication utilization 44.4%
Thread Id Comm. utilization
Thread Average 44.4%
23408 100.0%
23409 100.0%
23410 19.1%
23411 12.5%
23412 100.0%
23413 12.5%
23414 100.0%
23415 100.0%
23416 12.7%
23417 100.0%
23418 100.0%
23419 37.5%
23420 100.0%
23421 18.6%
23422 18.7%
23423 49.0%
23424 48.7%
23425 100.0%
23426 35.5%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 35.7%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low
Worst instruction "octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1a1 (0x9e40b1) [R], grid_fmm.cpp:809

Fusible instruction group, first loop #3

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 20.7% ]
       "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 20.7% ]
          "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 20.7% ]
             "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 20.7% ]
                "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 20.7% ]
                   "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 20.7% ]
                      "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 20.7% ]
                         "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 40.3% ]
                            "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0xf5e (0xa25f3e), node_server.cpp:433
"octotiger"!grid::compute_interactions(gsolve_type)+0x829 (0x9e7499) [R], grid_fmm.cpp:370 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.1% 0.0% 0.1% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
100.0%
Thread Id Fetch utilization
Thread Average 100.0%
23408 35.5%
23409 42.6%
23410 99.5%
23411 8.5%
23412 100.0%
23413 100.0%
23414 12.0%
23415 30.0%
23416 36.1%
23417 18.8%
23418 11.3%
23419 5.9%
23420 41.5%
23421 9.4%
23422 35.1%
23423 100.0%
23424 12.0%
23425 80.3%
23426 100.0%
23427 23.9%
23428 88.6%
23429 73.1%
23430 44.3%
23431 41.3%
30.0%
Thread Id Write-back utilization
Thread Average 30.0%
23408 47.6%
23409 80.2%
23410 36.5%
23411 33.8%
23412 63.4%
23413 37.4%
23414 21.7%
23415 40.9%
23416 31.5%
23417 36.5%
23418 29.2%
23419 24.9%
23420 37.6%
23421 20.4%
23422 30.1%
23423 31.6%
23424 25.8%
23425 28.1%
23426 24.4%
23427 34.3%
23428 72.3%
23429 37.7%
23430 25.9%
23431 27.7%
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 29.5% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 39.1% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0xf5e (0xa25f3e), node_server.cpp:433
"octotiger"!grid::compute_interactions(gsolve_type)+0x834 (0x9e74a4) [W], grid_fmm.cpp:370 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
100.0%
Thread Id Fetch utilization
Thread Average 100.0%
23408 35.5%
23409 42.6%
23410 99.5%
23411 8.5%
23412 100.0%
23413 100.0%
23414 12.0%
23415 30.0%
23416 36.1%
23417 18.8%
23418 11.3%
23419 5.9%
23420 41.5%
23421 9.4%
23422 35.1%
23423 100.0%
23424 12.0%
23425 80.3%
23426 100.0%
23427 23.9%
23428 88.6%
23429 73.1%
23430 44.3%
23431 41.3%
30.0%
Thread Id Write-back utilization
Thread Average 30.0%
23408 47.6%
23409 80.2%
23410 36.5%
23411 33.8%
23412 63.4%
23413 37.4%
23414 21.7%
23415 40.9%
23416 31.5%
23417 36.5%
23418 29.2%
23419 24.9%
23420 37.6%
23421 20.4%
23422 30.1%
23423 31.6%
23424 25.8%
23425 28.1%
23426 24.4%
23427 34.3%
23428 72.3%
23429 37.7%
23430 25.9%
23431 27.7%

Fusible instruction group, second loop #1

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 21.9% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 21.9% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 23.8% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 23.8% ]
                "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 24.5% ]
                   "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 24.5% ]
                      "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 24.5% ]
                         "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 24.5% ]
                            "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 30.6% ]
                               "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 31.0% ]
                                  "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 32.6% ]
                                     "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 50.8% ]
                                        "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 50.8% ]
                                           "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329
                                              "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374
                                                 "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88
                                                    "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196
                                                       "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36
                                                          "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196
                                                             "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36
                                                                "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1a1 (0x9e40b1) [R], grid_fmm.cpp:809 6.4%
Thread Id % of misses
Thread Total 6.4%
23408 0.2%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.6%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
6.4%
Thread Id % of fetches
Thread Total 6.4%
23408 0.2%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.6%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
2.5%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 2.5% 0.0% 1.7% 0.8% 0.0%
23408 2.0% 0.0% 1.8% 0.2% 0.0%
23409 7.5% 0.0% 7.5% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 2.4% 0.0% 2.4% 0.0% 0.0%
23412 0.1% 0.0% 0.1% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 3.9% 0.0% 3.9% 0.0% 0.0%
23415 4.3% 0.0% 2.5% 1.8% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 7.8% 0.0% 2.8% 5.0% 0.0%
23419 3.5% 0.0% 1.8% 1.8% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 7.5% 0.0% 4.1% 3.4% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.1% 0.0% 0.1% 0.0% 0.0%
23424 0.2% 0.0% 0.2% 0.0% 0.0%
23425 4.6% 0.0% 4.4% 0.2% 0.0%
23426 2.1% 0.0% 2.0% 0.1% 0.0%
23427 6.3% 0.0% 3.2% 3.1% 0.0%
23428 3.9% 0.0% 3.0% 1.0% 0.0%
23429 3.1% 0.0% 0.0% 3.1% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 2.5% 0.0% 2.2% 0.2% 0.0%
56.3%
Thread Id Fetch utilization
Thread Average 56.3%
23408 25.1%
23409 16.5%
23410 100.0%
23411 42.2%
23412 100.0%
23413 100.0%
23414 27.9%
23415 32.3%
23416 100.0%
23417 100.0%
23418 22.7%
23419 13.1%
23420 100.0%
23421 21.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 37.8%
23426 28.3%
23427 19.1%
23428 37.1%
23429 58.4%
23430 100.0%
23431 100.0%
46.9%
Thread Id Write-back utilization
Thread Average 46.9%
23408 73.7%
23409 32.4%
23410 100.0%
23411 34.6%
23412 100.0%
23413 100.0%
23414 67.1%
23415 38.5%
23416 100.0%
23417 100.0%
23418 12.7%
23419 16.2%
23420 100.0%
23421 29.3%
23422 100.0%
23423 100.0%
23424 100.0%
23425 19.6%
23426 17.6%
23427 12.7%
23428 35.3%
23429 37.7%
23430 100.0%
23431 62.3%
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1ac (0x9e40bc) [W], grid_fmm.cpp:809 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
56.3%
Thread Id Fetch utilization
Thread Average 56.3%
23408 25.1%
23409 16.5%
23410 100.0%
23411 42.2%
23412 100.0%
23413 100.0%
23414 27.9%
23415 32.3%
23416 100.0%
23417 100.0%
23418 22.7%
23419 13.1%
23420 100.0%
23421 21.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 37.8%
23426 28.3%
23427 19.1%
23428 37.1%
23429 58.4%
23430 100.0%
23431 100.0%
46.9%
Thread Id Write-back utilization
Thread Average 46.9%
23408 73.7%
23409 32.4%
23410 100.0%
23411 34.6%
23412 100.0%
23413 100.0%
23414 67.1%
23415 38.5%
23416 100.0%
23417 100.0%
23418 12.7%
23419 16.2%
23420 100.0%
23421 29.3%
23422 100.0%
23423 100.0%
23424 100.0%
23425 19.6%
23426 17.6%
23427 12.7%
23428 35.3%
23429 37.7%
23430 100.0%
23431 62.3%

Potential barriers for moving the body of the second loop #1

Code motion barrier Loop instruction
"octotiger"!hpx::lcos::local::spinlock::lock()+0x61 (0x853011), thread_helpers.hpp:499
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1ac (0x9e40bc) [W], grid_fmm.cpp:809
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1ac (0x9e40bc) [W], grid_fmm.cpp:809
"octotiger"!grid::compute_boundary_interactions_multipole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}::operator()(unsigned long) const+0x7de (0x9de7ee), thread_helpers.hpp:499
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1ac (0x9e40bc) [W], grid_fmm.cpp:809
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1ac (0x9e40bc) [W], grid_fmm.cpp:809

Loop statistics, first loop #3

Accesses 2.04e+10
Thread Id Accesses
Thread Total 2.04e+10
23408 8.82e+08
23409 9.04e+08
23410 6.80e+08
23411 8.60e+08
23412 8.60e+08
23413 7.23e+08
23414 8.10e+08
23415 9.98e+08
23416 1.05e+09
23417 9.61e+08
23418 8.39e+08
23419 7.81e+08
23420 8.96e+08
23421 7.45e+08
23422 8.60e+08
23423 7.16e+08
23424 6.51e+08
23425 9.40e+08
23426 7.23e+08
23427 9.25e+08
23428 8.17e+08
23429 1.03e+09
23430 8.02e+08
23431 9.11e+08
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 0.5%
Thread Id % of misses
Thread Total 0.5%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.1%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
% of bandwidth 0.6%
Thread Id % of bandwidth
Thread Total 0.6%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.1%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
% of fetches 0.5%
Thread Id % of fetches
Thread Total 0.5%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.1%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
% of write-backs 0.9%
Thread Id % of write-backs
Thread Total 0.9%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.1%
23413 0.0%
23414 0.0%
23415 0.1%
23416 0.1%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.1%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.1%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.1%
23430 0.0%
23431 0.0%
% of upgrades 0.0%
Thread Id % of upgrades
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Miss ratio 0.1%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 0.1% 0.0% 0.1% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.2% 0.0% 0.2% 0.0% 0.0%
23413 0.1% 0.0% 0.1% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.1% 0.0% 0.1% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.1% 0.0% 0.1% 0.0% 0.0%
23418 0.1% 0.0% 0.1% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.1% 0.0% 0.1% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 0.1% 0.0% 0.1% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.2% 0.0% 0.2% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 0.1% 0.0% 0.1% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.1% 0.0% 0.1% 0.0% 0.0%
23429 0.1% 0.0% 0.1% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
Fetch ratio 0.1%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.1% 0.0% 0.1% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.2% 0.0% 0.2% 0.0% 0.0%
23413 0.1% 0.0% 0.1% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.1% 0.0% 0.1% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.1% 0.0% 0.1% 0.0% 0.0%
23418 0.1% 0.0% 0.1% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.1% 0.0% 0.1% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 0.1% 0.0% 0.1% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.2% 0.0% 0.2% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 0.1% 0.0% 0.1% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.1% 0.0% 0.1% 0.0% 0.0%
23429 0.1% 0.0% 0.1% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
Write-back ratio 0.1%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 0.1% 0.0% 0.1% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.2% 0.0% 0.2% 0.0% 0.0%
23413 0.1% 0.0% 0.1% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.1% 0.0% 0.1% 0.0% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.1% 0.0% 0.1% 0.0% 0.0%
23418 0.1% 0.0% 0.1% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.1% 0.0% 0.1% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.1% 0.0% 0.1% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 0.1% 0.0% 0.1% 0.0% 0.0%
23426 0.1% 0.0% 0.1% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.1% 0.0% 0.1% 0.0% 0.0%
23429 0.1% 0.0% 0.1% 0.0% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
Upgrade ratio 0.0%
Thread Id Upgrade ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Communication ratio 0.0%
Thread Id Comm. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Fetch utilization 44.7%
Thread Id Fetch utilization
Thread Average 44.7%
23408 64.0%
23409 100.0%
23410 83.0%
23411 34.4%
23412 15.0%
23413 55.7%
23414 71.2%
23415 10.4%
23416 16.0%
23417 26.9%
23418 31.1%
23419 100.0%
23420 35.2%
23421 100.0%
23422 37.7%
23423 100.0%
23424 22.7%
23425 17.7%
23426 100.0%
23427 60.0%
23428 28.4%
23429 20.0%
23430 15.4%
23431 100.0%
Write-back utilization 38.2%
Thread Id Write-back utilization
Thread Average 38.2%
23408 92.2%
23409 69.9%
23410 76.4%
23411 45.7%
23412 18.7%
23413 16.7%
23414 39.6%
23415 26.4%
23416 26.7%
23417 26.4%
23418 29.0%
23419 100.0%
23420 30.3%
23421 25.7%
23422 44.8%
23423 100.0%
23424 18.2%
23425 25.8%
23426 37.3%
23427 52.8%
23428 35.6%
23429 19.6%
23430 25.0%
23431 69.9%
Communication utilization 100.0%
Thread Id Comm. utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low

Loop instructions, first loop #3

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 20.7% ]
       "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 20.7% ]
          "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 20.7% ]
             "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 20.7% ]
                "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 20.7% ]
                   "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 20.7% ]
                      "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 20.7% ]
                         "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 40.3% ]
                            "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0xf5e (0xa25f3e), node_server.cpp:433
"octotiger"!grid::compute_interactions(gsolve_type)+0x829 (0x9e7499) [R], grid_fmm.cpp:370 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.1% 0.0% 0.1% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.1% 0.0% 0.1% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
100.0%
Thread Id Fetch utilization
Thread Average 100.0%
23408 35.5%
23409 42.6%
23410 99.5%
23411 8.5%
23412 100.0%
23413 100.0%
23414 12.0%
23415 30.0%
23416 36.1%
23417 18.8%
23418 11.3%
23419 5.9%
23420 41.5%
23421 9.4%
23422 35.1%
23423 100.0%
23424 12.0%
23425 80.3%
23426 100.0%
23427 23.9%
23428 88.6%
23429 73.1%
23430 44.3%
23431 41.3%
30.0%
Thread Id Write-back utilization
Thread Average 30.0%
23408 47.6%
23409 80.2%
23410 36.5%
23411 33.8%
23412 63.4%
23413 37.4%
23414 21.7%
23415 40.9%
23416 31.5%
23417 36.5%
23418 29.2%
23419 24.9%
23420 37.6%
23421 20.4%
23422 30.1%
23423 31.6%
23424 25.8%
23425 28.1%
23426 24.4%
23427 34.3%
23428 72.3%
23429 37.7%
23430 25.9%
23431 27.7%
"octotiger"!grid::compute_interactions(gsolve_type)+0x82e (0x9e749e) [R], grid_fmm.cpp:370 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.1% 0.0% 0.1% 0.0% 0.0%
68.2%
Thread Id Fetch utilization
Thread Average 68.2%
23408 86.5%
23409 43.3%
23410 78.1%
23411 82.5%
23412 87.3%
23413 90.5%
23414 35.7%
23415 100.0%
23416 42.0%
23417 100.0%
23418 100.0%
23419 48.0%
23420 46.6%
23421 87.6%
23422 66.7%
23423 100.0%
23424 68.1%
23425 100.0%
23426 47.4%
23427 41.7%
23428 100.0%
23429 100.0%
23430 100.0%
23431 39.6%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 29.5% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 39.1% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0xf5e (0xa25f3e), node_server.cpp:433
"octotiger"!grid::compute_interactions(gsolve_type)+0x834 (0x9e74a4) [W], grid_fmm.cpp:370 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
100.0%
Thread Id Fetch utilization
Thread Average 100.0%
23408 35.5%
23409 42.6%
23410 99.5%
23411 8.5%
23412 100.0%
23413 100.0%
23414 12.0%
23415 30.0%
23416 36.1%
23417 18.8%
23418 11.3%
23419 5.9%
23420 41.5%
23421 9.4%
23422 35.1%
23423 100.0%
23424 12.0%
23425 80.3%
23426 100.0%
23427 23.9%
23428 88.6%
23429 73.1%
23430 44.3%
23431 41.3%
30.0%
Thread Id Write-back utilization
Thread Average 30.0%
23408 47.6%
23409 80.2%
23410 36.5%
23411 33.8%
23412 63.4%
23413 37.4%
23414 21.7%
23415 40.9%
23416 31.5%
23417 36.5%
23418 29.2%
23419 24.9%
23420 37.6%
23421 20.4%
23422 30.1%
23423 31.6%
23424 25.8%
23425 28.1%
23426 24.4%
23427 34.3%
23428 72.3%
23429 37.7%
23430 25.9%
23431 27.7%
"octotiger"!grid::compute_interactions(gsolve_type)+0x839 (0x9e74a9) [R], grid_fmm.cpp:371 0.4%
Thread Id % of misses
Thread Total 0.4%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.1%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.4%
Thread Id % of fetches
Thread Total 0.4%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.1%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.3%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.3% 0.0% 0.3% 0.0% 0.0%
23408 0.1% 0.0% 0.1% 0.0% 0.0%
23409 0.1% 0.0% 0.1% 0.0% 0.0%
23410 0.1% 0.0% 0.1% 0.0% 0.0%
23411 0.2% 0.0% 0.2% 0.0% 0.0%
23412 0.8% 0.0% 0.8% 0.0% 0.0%
23413 0.4% 0.0% 0.4% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.7% 0.0% 0.7% 0.0% 0.0%
23416 0.3% 0.0% 0.3% 0.0% 0.0%
23417 0.3% 0.0% 0.3% 0.0% 0.0%
23418 0.4% 0.0% 0.4% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.7% 0.0% 0.7% 0.0% 0.0%
23421 0.3% 0.0% 0.3% 0.0% 0.0%
23422 0.3% 0.0% 0.3% 0.0% 0.0%
23423 0.1% 0.0% 0.1% 0.0% 0.0%
23424 0.8% 0.0% 0.8% 0.0% 0.0%
23425 0.3% 0.0% 0.3% 0.0% 0.0%
23426 0.4% 0.0% 0.4% 0.0% 0.0%
23427 0.1% 0.0% 0.1% 0.0% 0.0%
23428 0.5% 0.0% 0.5% 0.0% 0.0%
23429 0.6% 0.0% 0.6% 0.0% 0.0%
23430 0.5% 0.0% 0.5% 0.0% 0.0%
23431 0.2% 0.0% 0.2% 0.0% 0.0%
34.9%
Thread Id Fetch utilization
Thread Average 34.9%
23408 60.1%
23409 100.0%
23410 83.4%
23411 34.6%
23412 9.9%
23413 49.9%
23414 100.0%
23415 5.9%
23416 10.7%
23417 20.6%
23418 26.3%
23419 100.0%
23420 30.5%
23421 100.0%
23422 26.7%
23423 100.0%
23424 19.7%
23425 7.9%
23426 35.6%
23427 71.9%
23428 21.7%
23429 14.0%
23430 10.6%
23431 100.0%
39.1%
Thread Id Write-back utilization
Thread Average 39.1%
23408 96.1%
23409 68.6%
23410 82.5%
23411 48.7%
23412 18.5%
23413 16.2%
23414 100.0%
23415 25.9%
23416 26.1%
23417 25.3%
23418 29.0%
23419 100.0%
23420 29.5%
23421 27.2%
23422 47.1%
23423 100.0%
23424 17.1%
23425 25.6%
23426 38.3%
23427 54.6%
23428 34.9%
23429 19.2%
23430 24.9%
23431 85.5%
"octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 29.3% ]
       "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 29.3% ]
          "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 29.3% ]
             "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 29.3% ]
                "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 29.3% ]
                   "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 29.3% ]
                      "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 29.3% ]
                         "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 38.2% ]
                            "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0xf5e (0xa25f3e), node_server.cpp:433
"octotiger"!grid::compute_interactions(gsolve_type)+0x83e (0x9e74ae) [R], grid_fmm.cpp:371 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.1% 0.0% 0.1% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.1% 0.0% 0.1% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.1% 0.0% 0.1% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.1% 0.0% 0.1% 0.0% 0.0%
68.2%
Thread Id Fetch utilization
Thread Average 68.2%
23408 86.5%
23409 43.3%
23410 78.1%
23411 82.5%
23412 87.3%
23413 90.5%
23414 35.7%
23415 100.0%
23416 42.0%
23417 100.0%
23418 100.0%
23419 48.0%
23420 46.6%
23421 87.6%
23422 66.7%
23423 100.0%
23424 68.1%
23425 100.0%
23426 47.4%
23427 41.7%
23428 100.0%
23429 100.0%
23430 100.0%
23431 39.6%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 38.3% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 38.3% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0xf5e (0xa25f3e), node_server.cpp:433
"octotiger"!grid::compute_interactions(gsolve_type)+0x845 (0x9e74b5) [W], grid_fmm.cpp:371 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
34.9%
Thread Id Fetch utilization
Thread Average 34.9%
23408 60.1%
23409 100.0%
23410 83.4%
23411 34.6%
23412 9.9%
23413 49.9%
23414 100.0%
23415 5.9%
23416 10.7%
23417 20.6%
23418 26.3%
23419 100.0%
23420 30.5%
23421 100.0%
23422 26.7%
23423 100.0%
23424 19.7%
23425 7.9%
23426 35.6%
23427 71.9%
23428 21.7%
23429 14.0%
23430 10.6%
23431 100.0%
39.1%
Thread Id Write-back utilization
Thread Average 39.1%
23408 96.1%
23409 68.6%
23410 82.5%
23411 48.7%
23412 18.5%
23413 16.2%
23414 100.0%
23415 25.9%
23416 26.1%
23417 25.3%
23418 29.0%
23419 100.0%
23420 29.5%
23421 27.2%
23422 47.1%
23423 100.0%
23424 17.1%
23425 25.6%
23426 38.3%
23427 54.6%
23428 34.9%
23429 19.2%
23430 24.9%
23431 85.5%

Loop statistics, second loop #1

Accesses 2.34e+10
Thread Id Accesses
Thread Total 2.34e+10
23408 1.07e+09
23409 9.61e+08
23410 8.02e+08
23411 1.11e+09
23412 1.13e+09
23413 8.17e+08
23414 9.98e+08
23415 1.16e+09
23416 9.47e+08
23417 1.05e+09
23418 9.47e+08
23419 1.10e+09
23420 1.22e+09
23421 8.82e+08
23422 8.67e+08
23423 9.33e+08
23424 9.90e+08
23425 8.75e+08
23426 9.54e+08
23427 8.17e+08
23428 1.03e+09
23429 8.67e+08
23430 1.01e+09
23431 8.96e+08
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 6.7%
Thread Id % of misses
Thread Total 6.7%
23408 0.3%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.8%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
% of bandwidth 8.9%
Thread Id % of bandwidth
Thread Total 8.9%
23408 0.3%
23409 1.0%
23410 0.0%
23411 0.4%
23412 0.0%
23413 0.0%
23414 0.6%
23415 0.9%
23416 0.0%
23417 0.0%
23418 1.0%
23419 0.7%
23420 0.0%
23421 1.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.7%
23426 0.3%
23427 0.7%
23428 0.7%
23429 0.3%
23430 0.0%
23431 0.3%
% of fetches 6.7%
Thread Id % of fetches
Thread Total 6.7%
23408 0.3%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.8%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
% of write-backs 13.7%
Thread Id % of write-backs
Thread Total 13.7%
23408 0.5%
23409 1.6%
23410 0.0%
23411 0.6%
23412 0.0%
23413 0.0%
23414 0.9%
23415 1.2%
23416 0.0%
23417 0.0%
23418 1.6%
23419 1.0%
23420 0.0%
23421 1.5%
23422 0.0%
23423 0.0%
23424 0.0%
23425 1.0%
23426 0.5%
23427 1.0%
23428 1.0%
23429 0.5%
23430 0.0%
23431 0.5%
% of upgrades 20.8%
Thread Id % of upgrades
Thread Total 20.8%
23408 0.3%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 2.5%
23416 0.0%
23417 0.0%
23418 5.1%
23419 2.5%
23420 0.0%
23421 3.5%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.2%
23426 0.1%
23427 2.5%
23428 1.3%
23429 2.5%
23430 0.0%
23431 0.2%
Miss ratio 0.9%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 0.9% 0.0% 0.6% 0.3% 0.0%
23408 0.7% 0.0% 0.6% 0.1% 0.0%
23409 2.3% 0.0% 2.3% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.8% 0.0% 0.8% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 1.3% 0.0% 1.3% 0.0% 0.0%
23415 2.1% 0.6% 0.9% 0.6% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 2.4% 0.0% 0.9% 1.5% 0.0%
23419 1.3% 0.0% 0.7% 0.7% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 2.5% 0.0% 1.3% 1.1% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 1.7% 0.0% 1.6% 0.1% 0.0%
23426 0.8% 0.0% 0.7% 0.0% 0.0%
23427 1.8% 0.0% 0.9% 0.9% 0.0%
23428 1.4% 0.0% 1.1% 0.3% 0.0%
23429 0.8% 0.0% 0.0% 0.8% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.8% 0.0% 0.8% 0.1% 0.0%
Fetch ratio 0.9%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.9% 0.0% 0.6% 0.3% 0.0%
23408 0.7% 0.0% 0.6% 0.1% 0.0%
23409 2.3% 0.0% 2.3% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.8% 0.0% 0.8% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 1.3% 0.0% 1.3% 0.0% 0.0%
23415 2.1% 0.6% 0.9% 0.6% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 2.4% 0.0% 0.9% 1.5% 0.0%
23419 1.3% 0.0% 0.7% 0.7% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 2.5% 0.0% 1.3% 1.1% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 1.7% 0.0% 1.6% 0.1% 0.0%
23426 0.8% 0.0% 0.7% 0.0% 0.0%
23427 1.8% 0.0% 0.9% 0.9% 0.0%
23428 1.4% 0.0% 1.1% 0.3% 0.0%
23429 0.8% 0.0% 0.0% 0.8% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.8% 0.0% 0.8% 0.1% 0.0%
Write-back ratio 0.8%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 0.8% 0.0% 0.6% 0.3% 0.0%
23408 0.7% 0.0% 0.6% 0.1% 0.0%
23409 2.3% 0.0% 2.3% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.8% 0.0% 0.8% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 1.3% 0.0% 1.3% 0.0% 0.0%
23415 1.5% 0.0% 0.9% 0.6% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 2.4% 0.0% 0.8% 1.5% 0.0%
23419 1.3% 0.0% 0.7% 0.7% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 2.5% 0.0% 1.3% 1.1% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.1% 0.0% 0.1% 0.0% 0.0%
23425 1.7% 0.0% 1.6% 0.1% 0.0%
23426 0.8% 0.0% 0.7% 0.0% 0.0%
23427 1.8% 0.0% 0.9% 0.9% 0.0%
23428 1.4% 0.0% 1.1% 0.3% 0.0%
23429 0.8% 0.0% 0.0% 0.8% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.8% 0.0% 0.7% 0.1% 0.0%
Upgrade ratio 0.3%
Thread Id Upgrade ratio
Thread Average 0.3%
23408 0.1%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.6%
23416 0.0%
23417 0.0%
23418 1.5%
23419 0.7%
23420 0.0%
23421 1.1%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.1%
23426 0.0%
23427 0.9%
23428 0.3%
23429 0.8%
23430 0.0%
23431 0.1%
Communication ratio 0.5%
Thread Id Comm. ratio
Thread Average 0.5%
23408 0.1%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 1.2%
23416 0.0%
23417 0.0%
23418 3.1%
23419 1.3%
23420 0.0%
23421 2.2%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.1%
23426 0.1%
23427 1.8%
23428 0.7%
23429 1.7%
23430 0.0%
23431 0.2%
Fetch utilization 54.5%
Thread Id Fetch utilization
Thread Average 54.5%
23408 25.2%
23409 18.6%
23410 100.0%
23411 42.1%
23412 100.0%
23413 100.0%
23414 27.9%
23415 22.7%
23416 100.0%
23417 100.0%
23418 22.7%
23419 13.1%
23420 100.0%
23421 21.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 37.7%
23426 28.3%
23427 19.2%
23428 37.1%
23429 58.3%
23430 100.0%
23431 100.0%
Write-back utilization 46.9%
Thread Id Write-back utilization
Thread Average 46.9%
23408 73.7%
23409 32.4%
23410 100.0%
23411 34.6%
23412 100.0%
23413 100.0%
23414 67.1%
23415 38.5%
23416 100.0%
23417 100.0%
23418 12.7%
23419 16.2%
23420 100.0%
23421 29.3%
23422 100.0%
23423 100.0%
23424 100.0%
23425 19.6%
23426 17.6%
23427 12.7%
23428 35.3%
23429 37.7%
23430 100.0%
23431 62.3%
Communication utilization 44.4%
Thread Id Comm. utilization
Thread Average 44.4%
23408 100.0%
23409 100.0%
23410 19.1%
23411 12.5%
23412 100.0%
23413 12.5%
23414 100.0%
23415 100.0%
23416 12.7%
23417 100.0%
23418 100.0%
23419 37.5%
23420 100.0%
23421 18.6%
23422 18.7%
23423 49.0%
23424 48.7%
23425 100.0%
23426 35.5%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 35.7%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23409 Low
23410 Low
23411 Low
23412 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
23431 Low

Loop instructions, second loop #1

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 21.9% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 21.9% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 23.8% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 23.8% ]
                "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 24.5% ]
                   "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 24.5% ]
                      "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 24.5% ]
                         "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 24.5% ]
                            "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 30.6% ]
                               "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 31.0% ]
                                  "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 32.6% ]
                                     "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 50.8% ]
                                        "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 50.8% ]
                                           "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329
                                              "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374
                                                 "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88
                                                    "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196
                                                       "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36
                                                          "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196
                                                             "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36
                                                                "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1a1 (0x9e40b1) [R], grid_fmm.cpp:809 6.4%
Thread Id % of misses
Thread Total 6.4%
23408 0.2%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.6%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
6.4%
Thread Id % of fetches
Thread Total 6.4%
23408 0.2%
23409 0.7%
23410 0.0%
23411 0.3%
23412 0.0%
23413 0.0%
23414 0.4%
23415 0.6%
23416 0.0%
23417 0.0%
23418 0.7%
23419 0.5%
23420 0.0%
23421 0.7%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.5%
23426 0.2%
23427 0.5%
23428 0.5%
23429 0.2%
23430 0.0%
23431 0.2%
2.5%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 2.5% 0.0% 1.7% 0.8% 0.0%
23408 2.0% 0.0% 1.8% 0.2% 0.0%
23409 7.5% 0.0% 7.5% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 2.4% 0.0% 2.4% 0.0% 0.0%
23412 0.1% 0.0% 0.1% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 3.9% 0.0% 3.9% 0.0% 0.0%
23415 4.3% 0.0% 2.5% 1.8% 0.0%
23416 0.1% 0.0% 0.1% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 7.8% 0.0% 2.8% 5.0% 0.0%
23419 3.5% 0.0% 1.8% 1.8% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 7.5% 0.0% 4.1% 3.4% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.1% 0.0% 0.1% 0.0% 0.0%
23424 0.2% 0.0% 0.2% 0.0% 0.0%
23425 4.6% 0.0% 4.4% 0.2% 0.0%
23426 2.1% 0.0% 2.0% 0.1% 0.0%
23427 6.3% 0.0% 3.2% 3.1% 0.0%
23428 3.9% 0.0% 3.0% 1.0% 0.0%
23429 3.1% 0.0% 0.0% 3.1% 0.0%
23430 0.1% 0.0% 0.1% 0.0% 0.0%
23431 2.5% 0.0% 2.2% 0.2% 0.0%
56.3%
Thread Id Fetch utilization
Thread Average 56.3%
23408 25.1%
23409 16.5%
23410 100.0%
23411 42.2%
23412 100.0%
23413 100.0%
23414 27.9%
23415 32.3%
23416 100.0%
23417 100.0%
23418 22.7%
23419 13.1%
23420 100.0%
23421 21.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 37.8%
23426 28.3%
23427 19.1%
23428 37.1%
23429 58.4%
23430 100.0%
23431 100.0%
46.9%
Thread Id Write-back utilization
Thread Average 46.9%
23408 73.7%
23409 32.4%
23410 100.0%
23411 34.6%
23412 100.0%
23413 100.0%
23414 67.1%
23415 38.5%
23416 100.0%
23417 100.0%
23418 12.7%
23419 16.2%
23420 100.0%
23421 29.3%
23422 100.0%
23423 100.0%
23424 100.0%
23425 19.6%
23426 17.6%
23427 12.7%
23428 35.3%
23429 37.7%
23430 100.0%
23431 62.3%
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 22.2% ]
       "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 23.1% ]
          "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 23.1% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 23.9% ]
                "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 23.9% ]
                   "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 23.9% ]
                      "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 23.9% ]
                         "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 29.9% ]
                            "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 30.6% ]
                               "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 32.7% ]
                                  "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 51.4% ]
                                     "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 51.4% ]
                                        "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329
                                           "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374
                                              "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88
                                                 "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196
                                                    "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36
                                                       "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196
                                                          "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36
                                                             "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1a6 (0x9e40b6) [R], grid_fmm.cpp:809 0.3%
Thread Id % of misses
Thread Total 0.3%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.3%
Thread Id % of fetches
Thread Total 0.3%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.2%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.1%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.1% 0.1% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 2.0% 2.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
9.7%
Thread Id Fetch utilization
Thread Average 9.7%
23408 29.1%
23409 100.0%
23410 66.3%
23411 31.5%
23412 100.0%
23413 18.8%
23414 34.1%
23415 0.2%
23416 47.9%
23417 68.0%
23418 23.5%
23419 23.7%
23420 40.0%
23421 41.8%
23422 32.7%
23423 56.3%
23424 42.2%
23425 29.0%
23426 55.1%
23427 56.9%
23428 37.7%
23429 31.9%
23430 32.6%
23431 53.1%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23408 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23412 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
23431 100.0%
"octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 21.8% ]
       "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x11e (0x9597fe), node_server_actions_3.cpp:483 [ 21.8% ]
          "octotiger"!node_server::compute_fmm(gsolve_type, bool)+0x6bb (0xa2569b), packaged_continuation.hpp:430 [ 23.9% ]
             "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x98655b), future_data.hpp:552 [ 23.9% ]
                "octotiger"!hpx::lcos::detail::future_data<node_server::neighbor_gravity_type>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x98634a), basic_function.hpp:196 [ 24.6% ]
                   "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<node_server::neighbor_gravity_type> > const&)+0x11c (0xa23adc), packaged_continuation.hpp:105 [ 24.6% ]
                      "octotiger"!void hpx::lcos::detail::invoke_continuation<node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, hpx::lcos::future<node_server::neighbor_gravity_type>, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void> >(node_server::compute_fmm(gsolve_type, bool)::{lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}&, hpx::lcos::future<node_server::neighbor_gravity_type>&, hpx::lcos::detail::continuation<hpx::lcos::future<node_server::neighbor_gravity_type>, {lambda(hpx::lcos::future<node_server::neighbor_gravity_type>)#2}, void>&, std::integral_constant<bool, true>) [clone .isra.564] [clone .constprop.1360]+0x125 (0xa237c5), node_server.cpp:444 [ 24.6% ]
                         "octotiger"!grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)+0x49c (0x9e64dc), wait_all.hpp:329 [ 24.6% ]
                            "octotiger"!void hpx::lcos::wait_all<hpx::lcos::future<void> >(std::vector<hpx::lcos::future<void>, std::allocator<hpx::lcos::future<void> > > const&)+0x2bf (0x88828f), wait_all.hpp:306 [ 30.6% ]
                               "octotiger"!hpx::lcos::detail::future_data<void>::wait(hpx::error_code&)+0xb4 (0x85b074), future_data.hpp:567 [ 30.9% ]
                                  "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 32.5% ]
                                     "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472 [ 50.5% ]
                                        "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374 [ 50.5% ]
                                           "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329
                                              "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374
                                                 "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88
                                                    "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196
                                                       "libhpx.so.1.0.0"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*)(hpx::util::unique_function<void (), false>)> (hpx::util::unique_function<void (), false>&&)> >(void**, hpx::threads::thread_state_ex_enum&&)+0x46 (0x7faa7f4efdf6), invoke.hpp:36
                                                          "libhpx.so.1.0.0"!hpx::applier::thread_function_nullary(hpx::util::unique_function<void (), false>)+0xe (0x7faa7f5795be), basic_function.hpp:196
                                                             "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::deferred<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (*(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >&&))(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)> >(void**)+0x22 (0x84a452), invoke.hpp:36
                                                                "octotiger"!hpx::lcos::detail::task_base<void>::run_impl(boost::intrusive_ptr<hpx::lcos::detail::task_base<void> >)+0xd (0x84a3dd), future_data.hpp:782
"octotiger"!hpx::lcos::local::detail::task_object<void, hpx::util::detail::deferred<hpx::parallel::util::detail::partitioner_iteration<void, hpx::parallel::v2::detail::part_iterations<grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&, int, hpx::util::tuple<> > >& (grid::compute_boundary_interactions_monopole_monopole(gsolve_type, std::vector<boundary_interaction_type, std::allocator<boundary_interaction_type> > const&, gravity_boundary_type const&)::{lambda(unsigned long)#1}&<unsigned long, unsigned long, unsigned long> const&)>, hpx::lcos::detail::task_base<void> >::do_run()+0x1ac (0x9e40bc) [W], grid_fmm.cpp:809 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23412 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
23431 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23413 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
23431 0.0% 0.0% 0.0% 0.0% 0.0%
56.3%
Thread Id Fetch utilization
Thread Average 56.3%
23408 25.1%
23409 16.5%
23410 100.0%
23411 42.2%
23412 100.0%
23413 100.0%
23414 27.9%
23415 32.3%
23416 100.0%
23417 100.0%
23418 22.7%
23419 13.1%
23420 100.0%
23421 21.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 37.8%
23426 28.3%
23427 19.1%
23428 37.1%
23429 58.4%
23430 100.0%
23431 100.0%
46.9%
Thread Id Write-back utilization
Thread Average 46.9%
23408 73.7%
23409 32.4%
23410 100.0%
23411 34.6%
23412 100.0%
23413 100.0%
23414 67.1%
23415 38.5%
23416 100.0%
23417 100.0%
23418 12.7%
23419 16.2%
23420 100.0%
23421 29.3%
23422 100.0%
23423 100.0%
23424 100.0%
23425 19.6%
23426 17.6%
23427 12.7%
23428 35.3%
23429 37.7%
23430 100.0%
23431 62.3%

Copyright (c) 2006-2012 Rogue Wave Software, Inc. All Rights Reserved.
Patents pending.