Monitoring Performance Metrics in Low-Power Wireless Systems

Fabian Graf, Thomas Watteyne, Michael Villnow

Abstract:

Application Performance Monitoring (APM) is key for ensuring computer systems perform well. While most APM tools target servers and networking infrastructure, here we focus on APM for devices with strict resource constraints: extremely limited in terms of power, memory and bandwidth. We tailor this article to be both a survey and a tutorial. In the survey part, we investigate APM approaches for low-power wireless networks, with a particular focus on Time Synchronized Channel Hopping solutions, as they are well-suited for critical industrial applications. We survey performance metrics characterizing the network health condition and show how, to capture the health of a network universally, it is important to constantly monitor hardware-related, network-related and network-wide metrics. We present a collection of metrics that serves as a checklist for the design of an APM system, describe related work on APM concepts suitable for low-power wireless system, and provide core concepts for collecting, exporting and processing performance metrics. The tutorial part consists of a hands-on example of running commercial APM and networking solutions. We use the active APM framework from Memfault, which periodically creates heartbeats including the performance metrics. We run this framework on top of the SmartMesh IP protocol stack, a commercial product by Analog Devices that offers wired-like high reliability and a decade of battery lifetime, and integrate it with the Zephyr operating systems. This tutorial allows the readership to experiment with a complete ready-to-deploy mote-to-cloud APM chain.

 

Read the paper: https://hal.science/hal-04668822