hey all you people

Applied’s Log Management Handbook: Powering Autonomy With Log Data (Part 1)

October 19, 2022

Log data management is one of the most important tasks that every autonomy program needs to master. Test fleets collect on average four terabytes of log data per vehicle per day, while production fleets (i.e., vehicles purchased by individual consumers) can generate millions of events per day. This firehose of data has enormous potential to power an autonomy program’s development efforts.

Due to the costs and risks involved in real-world testing, it is crucial that autonomy programs collect and manage their log data effectively. For example, to operate a test fleet, autonomy programs must purchase and maintain vehicles and sensors and pay a team of safety operators. Additionally, just one critical mistake during real-world testing can put human lives at risk. Autonomy programs should thus implement practices to scale their data collection efficiently, create a pipeline for effective log data processing, and build scalable workflows that extract the maximum value from all collected data.

Applied Intuition’s log management handbook discusses the technical building blocks, ideal workflows, and cost management strategies of an expansive log management process. This blog post is the first in a three-part series providing a short introduction to these topics. The full-length handbook is available for download below.

Read our log management handbook
Oops! Something went wrong while submitting the form. Please try again.

Introducing Applied’s Log Management Handbook

Definition of log data

In autonomous systems development, log data is any real-world data collected on the system corresponding to the autonomous task at hand. For autonomous vehicles, log data is collected during a drive and ranges from raw sensor inputs to pedal or wheel actuation commands.

Who should read this handbook?

The concepts, principles, and approaches laid out in our log management handbook apply to autonomy programs of all sizes and across industries. Most metrics and examples concern SAE Level 2-4 systems in automotive, but the contents of our handbook are equally relevant to autonomous trucking, construction, mining, and agriculture as well as warehouse robots, unmanned aerial systems, and other types of autonomous systems.

The log management life cycle

Our handbook’s structure follows the journey of a log file from inception to long-term storage (Figure 1). First, an autonomous system collects the log file. Next, data processing pipelines distribute the log file, and different teams explore it according to their specific use case. Finally, the log file lands in long-term storage. Our handbook discusses each of these steps in detail.

Figure 1: Components of a log data workflow.

Different workflows powered by log data

Log data powers various workflows for different teams within autonomous systems development. Our handbook covers the following workflows:

  • Data science: Building a platform to mine data, run analytics, and extract metrics from fleet data.
  • Diagnosing issues: Triaging issues from real-world testing effectively and assigning problems for development teams to solve.
  • Module development: Scaling the development of perception, prediction, motion planning, and localization modules in a cost-effective manner.
  • Curating labeled datasets: Detecting events of interest and creating labels for machine learning (ML) training.
  • Simulation: Creating simulations from log data to power development and triage.
  • Acceptance testing: Verifying if the release of a new software version is ready for further use.
  • Validating supplier solutions: Determining if a system provided by a Tier 1 or Tier 2 supplier performs to the required specifications.
  • Regression testing: Ensuring that previously solved issues do not reappear.

Conclusion

Log data is one of the most essential building blocks of autonomous systems development. Our log management handbook explains common log data management challenges and lays out recommended practices for autonomy programs across industries. The next part of this blog post series will summarize the handbook’s key insights regarding an important step in the log management life cycle: Log exploration. Stay tuned for the next blog post, or download the entire handbook today.

Download the full handbook
Oops! Something went wrong while submitting the form. Please try again.

Contact the Applied team with your questions about this handbook and learn more about Applied’s log management solutions on our website.