Issue #45: Loop fusion

It may be possible to merge the bodies of loop #172 and loop #68 by moving the first loop down or the second loop up.

Statistics for fusible instruction group, second loop #68

Accesses 1.88e+08
Thread Id Accesses
Thread Total 1.88e+08
23409 0.00e+00
23410 2.17e+07
23411 0.00e+00
23413 7.23e+06
23414 2.17e+07
23415 1.45e+07
23416 0.00e+00
23417 0.00e+00
23418 2.17e+07
23419 7.23e+06
23420 0.00e+00
23421 1.45e+07
23422 2.89e+07
23423 1.45e+07
23424 0.00e+00
23425 0.00e+00
23426 0.00e+00
23427 1.45e+07
23428 1.45e+07
23429 7.23e+06
23430 0.00e+00
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 0.7%
Thread Id % of misses
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of bandwidth 0.9%
Thread Id % of bandwidth
Thread Total 0.9%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.3%
23414 0.0%
23415 0.1%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.1%
23422 0.4%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of fetches 0.7%
Thread Id % of fetches
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of write-backs 1.4%
Thread Id % of write-backs
Thread Total 1.4%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.5%
23414 0.0%
23415 0.1%
23416 0.0%
23417 0.0%
23418 0.1%
23419 0.0%
23420 0.0%
23421 0.1%
23422 0.6%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of upgrades 0.0%
Thread Id % of upgrades
Thread Total 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Miss ratio 10.7%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 10.7% 0.0% 10.7% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 8.0% 0.0% 8.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.8% 0.0% 3.8% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 1.5% 0.0% 1.5% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
Fetch ratio 10.7%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 10.7% 0.0% 10.7% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 8.0% 0.0% 8.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.8% 0.0% 3.8% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 1.5% 0.0% 1.5% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
Write-back ratio 10.7%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 10.7% 0.0% 10.7% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 8.0% 0.0% 8.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.8% 0.0% 3.8% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 1.5% 0.0% 1.5% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
Upgrade ratio 0.0%
Thread Id Upgrade ratio
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Communication ratio 0.0%
Thread Id Comm. ratio
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Fetch utilization 0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23409 100.0%
23410 0.0%
23411 100.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%
Write-back utilization 40.5%
Thread Id Write-back utilization
Thread Average 40.5%
23409 100.0%
23410 100.0%
23411 100.0%
23413 12.5%
23414 100.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 20.3%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%
Communication utilization 100.0%
Thread Id Comm. utilization
Thread Average 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23409 Low
23410 Low
23411 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low
Worst instruction "octotiger"!grid::compute_fluxes()+0x428 (0x9ce998) [W], grid.cpp:1733

Fusible instruction group, first loop #172

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!node_server::regrid_gather(bool)+0x532 (0x83e432), node_server_actions_1.cpp:249 [ 70.0% ]
       "octotiger"!node_client::operator=(hpx::lcos::future<hpx::naming::id_type>&&)+0x18 (0xa17648), node_client.cpp:19 [ 70.0% ]
          "octotiger"!hpx::lcos::future<hpx::naming::id_type>::get()+0x1f (0x8f9d7f), future.hpp:904 [ 70.0% ]
             "octotiger"!hpx::lcos::detail::future_data<hpx::naming::id_type>::get_result(hpx::error_code&)+0x12 (0x8526b2), future_data.hpp:297 [ 70.0% ]
                "octotiger"!hpx::lcos::detail::future_data<hpx::naming::id_type>::wait(hpx::error_code&)+0xb4 (0x85a3b4), future_data.hpp:567 [ 70.0% ]
                   "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 81.0% ]
                      "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472
                         "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374
                            "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329
                               "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374
                                  "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88
                                     "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196
                                        "octotiger"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::actions::detail::continuation_thread_function<hpx::components::server::create_component_action<node_server, node_location, node_client, double, double>, hpx::actions::basic_action<hpx::components::server::runtime_support, hpx::naming::gid_type (node_location, node_client, double, double), hpx::components::server::create_component_action<node_server, node_location, node_client, double, double> >::invoker, unsigned long&, node_location&&, node_client&, double&, double&> >(void**, hpx::threads::thread_state_ex_enum&&)+0xfc (0x846acc), trigger.hpp:128
                                           "octotiger"!void hpx::actions::detail::trigger_impl<hpx::naming::id_type, hpx::naming::gid_type, hpx::util::detail::deferred<hpx::actions::basic_action<hpx::components::server::runtime_support, hpx::naming::gid_type (node_location, node_client, double, double), hpx::components::server::create_component_action<node_server, node_location, node_client, double, double> >::invoker (unsigned long&, node_location&&, node_client&&, double&&, double&&)>&>(std::integral_constant<bool, false>, hpx::actions::typed_continuation<hpx::naming::id_type, hpx::naming::gid_type>&&, hpx::util::detail::deferred<hpx::actions::basic_action<hpx::components::server::runtime_support, hpx::naming::gid_type (node_location, node_client, double, double), hpx::components::server::create_component_action<node_server, node_location, node_client, double, double> >::invoker (unsigned long&, node_location&&, node_client&&, double&&, double&&)>&)+0x74 (0x8dd394), component_action.hpp:64
                                              "octotiger"!
                                                 "octotiger"!hpx::components::component_factory<hpx::components::managed_component<node_server, hpx::components::detail::this_type> >::create_with_args(hpx::util::unique_function<void (void*), false> const&)+0x28 (0xa2c378), component_factory.hpp:197
                                                    "octotiger"!hpx::naming::gid_type hpx::components::server::create<hpx::components::managed_component<node_server, hpx::components::detail::this_type> >(hpx::util::unique_function<void (void*), false> const&)+0x75 (0xa2c1a5), basic_function.hpp:196
                                                       "octotiger"!void hpx::util::detail::callable_vtable<void (void*)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<hpx::util::functional::placement_new<hpx::components::managed_component<node_server, hpx::components::detail::this_type> > > (hpx::util::detail::placeholder<1ul> const&, node_location&&, node_client&&, double&&, double&&)> >(void**, void*&&)+0x53 (0x841433), managed_component_base.hpp:71
                                                          "octotiger"!node_server::node_server(node_location const&, node_client const&, double, double)+0x414 (0xa1ff14), node_server.cpp:361
                                                             "octotiger"!node_server::initialize(double, double)+0x314 (0xa1eac4), new_allocator.h:120
                                                                "octotiger"!grid::grid(std::function<std::vector<double, std::allocator<double> > (double, double, double, double)> const&, double, std::array<double, 3ul>)+0x4de (0x9caf4e), grid.cpp:1364
                                                                   "octotiger"!grid::allocate()+0x3d5 (0x9ca455), stl_vector.h:676
"octotiger"!std::vector<double, std::allocator<double> >::_M_default_append(unsigned long)+0xc0 (0x8982b0) [W], stl_algobase.h:766 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 100.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
0.0%
Thread Id Write-back utilization
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 100.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%

Fusible instruction group, second loop #68

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x81 (0x7faa7f5509f1), context_linux_x86.hpp:374 [ 26.8% ]
       "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 26.8% ]
          "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 26.8% ]
             "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x1f1 (0x9598d1), packaged_continuation.hpp:138 [ 56.9% ]
                "octotiger"!hpx::lcos::detail::future_data<void>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x867f5b), future_data.hpp:552 [ 56.9% ]
                   "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196 [ 59.4% ]
                      "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::bound<hpx::lcos::detail::transfer_result<hpx::lcos::future<void> > (boost::intrusive_ptr<hpx::lcos::detail::future_data<void> >&, boost::intrusive_ptr<hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >&)> >(void**)+0x36 (0x9551e6), packaged_continuation.hpp:51 [ 59.4% ]
                         "octotiger"!void hpx::lcos::detail::future_data<void>::set_value<hpx::util::unused_type>(hpx::util::unused_type&&, hpx::error_code&)+0x18f (0x887a1f), future_data.hpp:430 [ 59.4% ]
                            "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196 [ 59.4% ]
                               "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::bound<hpx::lcos::detail::transfer_result<hpx::lcos::future<void> > (boost::intrusive_ptr<hpx::lcos::detail::future_data<void> >&, boost::intrusive_ptr<hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >&)> >(void**)+0x36 (0x955306), packaged_continuation.hpp:51 [ 59.4% ]
                                  "octotiger"!void hpx::lcos::detail::future_data<void>::set_value<hpx::util::unused_type>(hpx::util::unused_type&&, hpx::error_code&)+0x18f (0x887a1f), future_data.hpp:430 [ 59.4% ]
                                     "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196
                                        "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<void> > const&)+0x11c (0x95edbc), packaged_continuation.hpp:210
                                           "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util::result_of&, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2244]+0x48 (0x95e518), node_server_actions_3.cpp:456
"octotiger"!grid::compute_fluxes()+0x428 (0x9ce998) [W], grid.cpp:1733 0.7%
Thread Id % of misses
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
0.7%
Thread Id % of fetches
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
10.7%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 10.7% 0.0% 10.7% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 8.0% 0.0% 8.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.8% 0.0% 3.8% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 1.5% 0.0% 1.5% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23409 100.0%
23410 0.0%
23411 100.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%
40.5%
Thread Id Write-back utilization
Thread Average 40.5%
23409 100.0%
23410 100.0%
23411 100.0%
23413 12.5%
23414 100.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 20.3%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%

Loop statistics, first loop #172

Accesses 5.06e+07
Thread Id Accesses
Thread Total 5.06e+07
23408 7.23e+06
23412 7.23e+06
23415 0.00e+00
23417 7.23e+06
23418 7.23e+06
23420 7.23e+06
23428 7.23e+06
23429 7.23e+06
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
% of bandwidth 0.0%
Thread Id % of bandwidth
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
% of fetches 0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
% of write-backs 0.0%
Thread Id % of write-backs
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
% of upgrades 0.0%
Thread Id % of upgrades
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
Miss ratio 0.0%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
Fetch ratio 0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
Write-back ratio 0.0%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
Upgrade ratio 0.0%
Thread Id Upgrade ratio
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
Communication ratio 0.0%
Thread Id Comm. ratio
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
Fetch utilization 0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 100.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
Write-back utilization 0.0%
Thread Id Write-back utilization
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 100.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
Communication utilization 100.0%
Thread Id Comm. utilization
Thread Average 100.0%
23408 100.0%
23412 100.0%
23415 100.0%
23417 100.0%
23418 100.0%
23420 100.0%
23428 100.0%
23429 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23408 Low
23412 Low
23415 Low
23417 Low
23418 Low
23420 Low
23428 Low
23429 Low

Loop instructions, first loop #172

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"octotiger"!node_server::regrid_gather(bool)+0x532 (0x83e432), node_server_actions_1.cpp:249 [ 70.0% ]
       "octotiger"!node_client::operator=(hpx::lcos::future<hpx::naming::id_type>&&)+0x18 (0xa17648), node_client.cpp:19 [ 70.0% ]
          "octotiger"!hpx::lcos::future<hpx::naming::id_type>::get()+0x1f (0x8f9d7f), future.hpp:904 [ 70.0% ]
             "octotiger"!hpx::lcos::detail::future_data<hpx::naming::id_type>::get_result(hpx::error_code&)+0x12 (0x8526b2), future_data.hpp:297 [ 70.0% ]
                "octotiger"!hpx::lcos::detail::future_data<hpx::naming::id_type>::wait(hpx::error_code&)+0xb4 (0x85a3b4), future_data.hpp:567 [ 70.0% ]
                   "libhpx.so.1.0.0"!hpx::lcos::local::detail::condition_variable::wait(std::unique_lock<hpx::lcos::local::spinlock>&, char const*, hpx::error_code&)+0xbf (0x7faa7fa0a8ef), thread_helpers.hpp:499 [ 81.0% ]
                      "libhpx.so.1.0.0"!hpx::this_thread::suspend(hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> const&, hpx::util::thread_description const&, hpx::error_code&)+0xf8 (0x7faa7f577fe8), thread_helpers.cpp:472
                         "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_self::yield(std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> >)+0xbc (0x7faa7f4f7f6c), context_linux_x86.hpp:374
                            "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329
                               "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374
                                  "libhpx.so.1.0.0"!void hpx::threads::coroutines::detail::lx::trampoline<hpx::threads::coroutines::detail::coroutine_impl>(hpx::threads::coroutines::detail::coroutine_impl*)+0x9 (0x7faa7f466e09), context_linux_x86.hpp:88
                                     "libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x12b (0x7faa7f550a9b), basic_function.hpp:196
                                        "octotiger"!std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > hpx::util::detail::callable_vtable<std::pair<hpx::threads::thread_state_enum, boost::intrusive_ptr<hpx::threads::thread_data> > (hpx::threads::thread_state_ex_enum)>::_invoke<hpx::actions::detail::continuation_thread_function<hpx::components::server::create_component_action<node_server, node_location, node_client, double, double>, hpx::actions::basic_action<hpx::components::server::runtime_support, hpx::naming::gid_type (node_location, node_client, double, double), hpx::components::server::create_component_action<node_server, node_location, node_client, double, double> >::invoker, unsigned long&, node_location&&, node_client&, double&, double&> >(void**, hpx::threads::thread_state_ex_enum&&)+0xfc (0x846acc), trigger.hpp:128
                                           "octotiger"!void hpx::actions::detail::trigger_impl<hpx::naming::id_type, hpx::naming::gid_type, hpx::util::detail::deferred<hpx::actions::basic_action<hpx::components::server::runtime_support, hpx::naming::gid_type (node_location, node_client, double, double), hpx::components::server::create_component_action<node_server, node_location, node_client, double, double> >::invoker (unsigned long&, node_location&&, node_client&&, double&&, double&&)>&>(std::integral_constant<bool, false>, hpx::actions::typed_continuation<hpx::naming::id_type, hpx::naming::gid_type>&&, hpx::util::detail::deferred<hpx::actions::basic_action<hpx::components::server::runtime_support, hpx::naming::gid_type (node_location, node_client, double, double), hpx::components::server::create_component_action<node_server, node_location, node_client, double, double> >::invoker (unsigned long&, node_location&&, node_client&&, double&&, double&&)>&)+0x74 (0x8dd394), component_action.hpp:64
                                              "octotiger"!
                                                 "octotiger"!hpx::components::component_factory<hpx::components::managed_component<node_server, hpx::components::detail::this_type> >::create_with_args(hpx::util::unique_function<void (void*), false> const&)+0x28 (0xa2c378), component_factory.hpp:197
                                                    "octotiger"!hpx::naming::gid_type hpx::components::server::create<hpx::components::managed_component<node_server, hpx::components::detail::this_type> >(hpx::util::unique_function<void (void*), false> const&)+0x75 (0xa2c1a5), basic_function.hpp:196
                                                       "octotiger"!void hpx::util::detail::callable_vtable<void (void*)>::_invoke<hpx::util::detail::bound<hpx::util::detail::one_shot_wrapper<hpx::util::functional::placement_new<hpx::components::managed_component<node_server, hpx::components::detail::this_type> > > (hpx::util::detail::placeholder<1ul> const&, node_location&&, node_client&&, double&&, double&&)> >(void**, void*&&)+0x53 (0x841433), managed_component_base.hpp:71
                                                          "octotiger"!node_server::node_server(node_location const&, node_client const&, double, double)+0x414 (0xa1ff14), node_server.cpp:361
                                                             "octotiger"!node_server::initialize(double, double)+0x314 (0xa1eac4), new_allocator.h:120
                                                                "octotiger"!grid::grid(std::function<std::vector<double, std::allocator<double> > (double, double, double, double)> const&, double, std::array<double, 3ul>)+0x4de (0x9caf4e), grid.cpp:1364
                                                                   "octotiger"!grid::allocate()+0x3d5 (0x9ca455), stl_vector.h:676
"octotiger"!std::vector<double, std::allocator<double> >::_M_default_append(unsigned long)+0xc0 (0x8982b0) [W], stl_algobase.h:766 0.0%
Thread Id % of misses
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23408 0.0%
23412 0.0%
23415 0.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23408 0.0% 0.0% 0.0% 0.0% 0.0%
23412 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.2% 0.0% 0.2% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 100.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%
0.0%
Thread Id Write-back utilization
Thread Average 0.0%
23408 0.0%
23412 0.0%
23415 100.0%
23417 0.0%
23418 0.0%
23420 0.0%
23428 0.0%
23429 0.0%

Loop statistics, second loop #68

Accesses 3.25e+08
Thread Id Accesses
Thread Total 3.25e+08
23409 1.45e+07
23410 2.17e+07
23411 7.23e+06
23413 7.23e+06
23414 2.89e+07
23415 2.17e+07
23416 0.00e+00
23417 0.00e+00
23418 3.61e+07
23419 7.23e+06
23420 7.23e+06
23421 1.45e+07
23422 2.89e+07
23423 3.61e+07
23424 7.23e+06
23425 7.23e+06
23426 7.23e+06
23427 1.45e+07
23428 2.89e+07
23429 2.17e+07
23430 7.23e+06
Fetch/Miss ratio
Write-back ratio
Utilization
% of misses 0.7%
Thread Id % of misses
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of bandwidth 0.9%
Thread Id % of bandwidth
Thread Total 0.9%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.3%
23414 0.0%
23415 0.1%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.1%
23422 0.4%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of fetches 0.7%
Thread Id % of fetches
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of write-backs 1.4%
Thread Id % of write-backs
Thread Total 1.4%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.5%
23414 0.0%
23415 0.1%
23416 0.0%
23417 0.0%
23418 0.1%
23419 0.0%
23420 0.0%
23421 0.1%
23422 0.6%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
% of upgrades 0.0%
Thread Id % of upgrades
Thread Total 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Miss ratio 6.2%
Thread Id Total Miss ratio Uncategorized Replacement Coherence Flush
Thread Average 6.2% 0.0% 6.2% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 5.3% 0.0% 5.3% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 2.3% 0.0% 2.3% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.3% 0.0% 0.3% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.8% 0.0% 0.8% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
Fetch ratio 6.2%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 6.2% 0.0% 6.2% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 5.3% 0.0% 5.3% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 2.3% 0.0% 2.3% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.3% 0.0% 0.3% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.8% 0.0% 0.8% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
Write-back ratio 6.2%
Thread Id Total Write-back ratio Uncategorized Replacement Coherence Flush
Thread Average 6.2% 0.0% 6.2% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 5.3% 0.0% 5.3% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 2.3% 0.0% 2.3% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.8% 0.0% 0.8% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
Upgrade ratio 0.0%
Thread Id Upgrade ratio
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Communication ratio 0.0%
Thread Id Comm. ratio
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Fetch utilization 0.7%
Thread Id Fetch utilization
Thread Average 0.7%
23409 100.0%
23410 0.0%
23411 100.0%
23413 0.0%
23414 0.9%
23415 1.6%
23416 100.0%
23417 100.0%
23418 1.4%
23419 0.0%
23420 100.0%
23421 0.0%
23422 0.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 31.9%
23427 0.0%
23428 5.9%
23429 100.0%
23430 100.0%
Write-back utilization 40.5%
Thread Id Write-back utilization
Thread Average 40.5%
23409 100.0%
23410 100.0%
23411 100.0%
23413 12.5%
23414 100.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 20.3%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%
Communication utilization 100.0%
Thread Id Comm. utilization
Thread Average 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23413 100.0%
23414 100.0%
23415 100.0%
23416 100.0%
23417 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
False sharing ratio 0.0%
Thread Id F-S. ratio
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
HW prefetch probability 0.0%
Thread Id HW prefetch probability
Thread Average 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
Access randomness Low
Thread Id Access randomness
Thread Average Low
23409 Low
23410 Low
23411 Low
23413 Low
23414 Low
23415 Low
23416 Low
23417 Low
23418 Low
23419 Low
23420 Low
23421 Low
23422 Low
23423 Low
23424 Low
23425 Low
23426 Low
23427 Low
23428 Low
23429 Low
23430 Low

Loop instructions, second loop #68

Stack Instruction % of misses % of fetches Fetch ratio Fetch utilization W-B Utilization
"libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x81 (0x7faa7f5509f1), context_linux_x86.hpp:374 [ 32.9% ]
       "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 32.9% ]
          "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 32.9% ]
             "octotiger"!node_server::nonrefined_step()+0x1f0 (0x9583b0), packaged_continuation.hpp:430 [ 56.4% ]
                "octotiger"!hpx::lcos::detail::future_data<void>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x867f5b), future_data.hpp:552 [ 56.4% ]
                   "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196
                      "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<void> > const&)+0x11c (0x95edbc), packaged_continuation.hpp:210
                         "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util::result_of&, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2244]+0x48 (0x95e518), node_server_actions_3.cpp:456
"octotiger"!grid::compute_fluxes()+0x420 (0x9ce990) [R], grid.cpp:1733 0.0%
Thread Id % of misses
Thread Total 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23414 0.0%
23415 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
0.0%
Thread Id % of fetches
Thread Total 0.0%
23409 0.0%
23410 0.0%
23411 0.0%
23414 0.0%
23415 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
0.0%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 0.0% 0.0% 0.0% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 0.0% 0.0% 0.0% 0.0% 0.0%
23418 0.0% 0.0% 0.0% 0.0% 0.0%
23419 0.0% 0.0% 0.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 0.0% 0.0% 0.0% 0.0% 0.0%
23422 0.0% 0.0% 0.0% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.3% 0.0% 0.3% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 0.0% 0.0% 0.0% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
100.0%
Thread Id Fetch utilization
Thread Average 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23414 100.0%
23415 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 31.9%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
100.0%
Thread Id Write-back utilization
Thread Average 100.0%
23409 100.0%
23410 100.0%
23411 100.0%
23414 100.0%
23415 100.0%
23418 100.0%
23419 100.0%
23420 100.0%
23421 100.0%
23422 100.0%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 100.0%
23428 100.0%
23429 100.0%
23430 100.0%
"libhpx.so.1.0.0"!hpx::threads::coroutines::detail::coroutine_impl::operator()()+0x81 (0x7faa7f5509f1), context_linux_x86.hpp:374 [ 26.8% ]
       "libhpx.so.1.0.0"!void hpx::threads::detail::scheduling_loop<hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo> >(unsigned long, hpx::threads::policies::local_priority_queue_scheduler<boost::mutex, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_fifo, hpx::threads::policies::lockfree_lifo>&, hpx::threads::detail::scheduling_counters&, hpx::threads::detail::scheduling_callbacks&)+0x21c (0x7faa7f508b3c), scheduling_loop.hpp:329 [ 26.8% ]
          "libhpx.so.1.0.0"!hpx::threads::thread_data::operator()()+0xcd (0x7faa7f50331d), context_linux_x86.hpp:374 [ 26.8% ]
             "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util&, hpx::util::result_of&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2248]+0x1f1 (0x9598d1), packaged_continuation.hpp:138 [ 56.9% ]
                "octotiger"!hpx::lcos::detail::future_data<void>::set_on_completed(hpx::util::unique_function<void (), false>)+0xeb (0x867f5b), future_data.hpp:552 [ 56.9% ]
                   "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196 [ 59.4% ]
                      "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::bound<hpx::lcos::detail::transfer_result<hpx::lcos::future<void> > (boost::intrusive_ptr<hpx::lcos::detail::future_data<void> >&, boost::intrusive_ptr<hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}::operator()(hpx::lcos::future<void>) const::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >&)> >(void**)+0x36 (0x9551e6), packaged_continuation.hpp:51 [ 59.4% ]
                         "octotiger"!void hpx::lcos::detail::future_data<void>::set_value<hpx::util::unused_type>(hpx::util::unused_type&&, hpx::error_code&)+0x18f (0x887a1f), future_data.hpp:430 [ 59.4% ]
                            "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196 [ 59.4% ]
                               "octotiger"!void hpx::util::detail::callable_vtable<void ()>::_invoke<hpx::util::detail::bound<hpx::lcos::detail::transfer_result<hpx::lcos::future<void> > (boost::intrusive_ptr<hpx::lcos::detail::future_data<void> >&, boost::intrusive_ptr<hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >&)> >(void**)+0x36 (0x955306), packaged_continuation.hpp:51 [ 59.4% ]
                                  "octotiger"!void hpx::lcos::detail::future_data<void>::set_value<hpx::util::unused_type>(hpx::util::unused_type&&, hpx::error_code&)+0x18f (0x887a1f), future_data.hpp:430 [ 59.4% ]
                                     "octotiger"!hpx::lcos::detail::future_data<void>::handle_on_completed(hpx::util::unique_function<void (), false>&&)+0x29a (0x85b9ea), basic_function.hpp:196
                                        "octotiger"!hpx::lcos::detail::continuation<hpx::lcos::future<void>, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >::run(boost::intrusive_ptr<hpx::lcos::detail::future_data<void> > const&)+0x11c (0x95edbc), packaged_continuation.hpp:210
                                           "octotiger"!std::enable_if<hpx::traits::detail::is_unique_future<hpx::util::result_of<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1} (hpx::lcos::future<void>)>::type, void>::value, void>::type hpx::lcos::detail::invoke_continuation<node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void>, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> > >(hpx::util::result_of&, node_server::nonrefined_step()::{lambda(hpx::lcos::future<void>)#1}&, hpx::lcos::detail::continuation<hpx::lcos::future<void>, {lambda(hpx::lcos::future<void>)#1}, hpx::lcos::future<void> >&) [clone .constprop.2244]+0x48 (0x95e518), node_server_actions_3.cpp:456
"octotiger"!grid::compute_fluxes()+0x428 (0x9ce998) [W], grid.cpp:1733 0.7%
Thread Id % of misses
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
0.7%
Thread Id % of fetches
Thread Total 0.7%
23409 0.0%
23410 0.0%
23411 0.0%
23413 0.2%
23414 0.0%
23415 0.0%
23416 0.0%
23417 0.0%
23418 0.0%
23419 0.0%
23420 0.0%
23421 0.0%
23422 0.3%
23423 0.0%
23424 0.0%
23425 0.0%
23426 0.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 0.0%
10.7%
Thread Id Total Fetch ratio Uncategorized Replacement Coherence Flush
Thread Average 10.7% 0.0% 10.7% 0.0% 0.0%
23409 0.0% 0.0% 0.0% 0.0% 0.0%
23410 0.0% 0.0% 0.0% 0.0% 0.0%
23411 0.0% 0.0% 0.0% 0.0% 0.0%
23413 100.0% 0.0% 100.0% 0.0% 0.0%
23414 0.0% 0.0% 0.0% 0.0% 0.0%
23415 8.0% 0.0% 8.0% 0.0% 0.0%
23416 0.0% 0.0% 0.0% 0.0% 0.0%
23417 0.0% 0.0% 0.0% 0.0% 0.0%
23418 3.8% 0.0% 3.8% 0.0% 0.0%
23419 8.0% 0.0% 8.0% 0.0% 0.0%
23420 0.0% 0.0% 0.0% 0.0% 0.0%
23421 8.0% 0.0% 8.0% 0.0% 0.0%
23422 30.8% 0.0% 30.8% 0.0% 0.0%
23423 0.0% 0.0% 0.0% 0.0% 0.0%
23424 0.0% 0.0% 0.0% 0.0% 0.0%
23425 0.0% 0.0% 0.0% 0.0% 0.0%
23426 0.0% 0.0% 0.0% 0.0% 0.0%
23427 0.0% 0.0% 0.0% 0.0% 0.0%
23428 1.5% 0.0% 1.5% 0.0% 0.0%
23429 0.0% 0.0% 0.0% 0.0% 0.0%
23430 0.0% 0.0% 0.0% 0.0% 0.0%
0.0%
Thread Id Fetch utilization
Thread Average 0.0%
23409 100.0%
23410 0.0%
23411 100.0%
23413 0.0%
23414 0.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 0.0%
23423 0.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%
40.5%
Thread Id Write-back utilization
Thread Average 40.5%
23409 100.0%
23410 100.0%
23411 100.0%
23413 12.5%
23414 100.0%
23415 0.0%
23416 100.0%
23417 100.0%
23418 0.0%
23419 0.0%
23420 100.0%
23421 0.0%
23422 20.3%
23423 100.0%
23424 100.0%
23425 100.0%
23426 100.0%
23427 0.0%
23428 0.0%
23429 0.0%
23430 100.0%

Copyright (c) 2006-2012 Rogue Wave Software, Inc. All Rights Reserved.
Patents pending.