Serialization - mjbots blog

In parts 1, 2, and 3 I covered some motivation for the updated mjlib diagnostics system and the serialization of individual structures. In this post, I’ll cover how those structures are written into a file from an embedded system like a robot and how diagnostic tools can access them efficiently.

Goals

The top level goals are:

Efficient to write live from an embedded system: The quad A1 generates log data currently at 400Hz, consisting of hundreds to thousands of telemetry data points in every update. It does this on a relatively low-end raspberry pi 3b+. The format should be able to support writing data at high rates without a significant CPU burden.
Efficient seeking by time and record: Readers of the file should be able to efficiently seek by time in the stream, as well as extract all of a single record without having to process unnecessary data from the log.
Self contained: While this property in the log comes from the underlying mjlib serialization format, it is worth re-iterating here. All information necessary to return a JSON or CSV like structure for each instance should be present within the log.

Design

The detailed design of the log format is documented at README.md, here I will give a brief summary.

In the previous issue in this series, I described the schema and data elements of the mjlib serialization format. Here, I’ll describe the API used to convert between C++ structures and the corresponding schema and data serializations.

First, I’ll start by saying this API is far from perfect. It hits a certain tradeoff in the design space that may not be appropriate for every system. I have developed and used similar APIs professionally both at Jaybridge and TRI, so it has seen use in millions of lines of code, but not billions by any stretch. It is also mostly orthogonal to the rest of the design, and alternate serialization APIs could be built while still maintaining the performance and schema evolution properties described in parts 1 and 2. Now with that out of the way, the library API:

As discussed previously, I recently significantly revised the serialization format used by the mjbots quad A1 based on experience in previous professional domains, and from studying newer external projects like Apache AVRO. Here I’ll describe the design of the serialized representation, which is more completely defined at: mjlib/telemetry/README.md

Refresher and definitions

As a brief refresher, this serialization format is intended to be used primarily to record telemetry from embedded systems, where that telemetry data may be persisted on disk for a long time. Secondarily, it can be used to inspect the results of a live system. The primitive it operates on is a “record”, which is logically a structure of elements which is emitted at some intervals over time. For any given record, it logically breaks it up into a “schema” and a “data” portion. The schema describes what types of elements are present in the structure, their names and relationships. The “data” portion contains the minimum amount of information necessary to communicate one instance of the structure, assuming that the receiver already has a copy of the schema.

Now that I have the qdd100 servo in beta phase, the IMU working at full rate, and the quad A1 is moving around I’m getting closer to actually working to improve the gaits that the machine can execute. To date, the gaits I have used completely ignore the IMU and only use the feedback from the joints in order to maintain force in 3D. With tuning and on controlled surfaces this can work well, but if you go outside the happy regime, then it can undergo significant pitch and roll movements during the leg swing phase, which at best results in a janky walk, and at worst results in oscillation or outright instability.

Archives: Serialization

Log file format (diagnostics part 4)

Goals

Design

C++ serialization API (diagnostics part 3)

Revised mjlib serialization design (diagnostics part 2)

Refresher and definitions

Updated serialization library (diagnostics part 1)