Edge Deployment Skill

Edge Deployment Skill

Overview

Expert skill for optimizing and deploying machine learning models on robot edge devices including NVIDIA Jetson and embedded systems.

Capabilities

•Configure TensorRT optimization for NVIDIA Jetson
•Set up ONNX model conversion and optimization
•Implement INT8 and FP16 quantization
•Configure DeepStream for video analytics
•Set up CUDA graph optimization
•Implement model pruning and distillation
•Configure DLA (Deep Learning Accelerator) deployment
•Set up multi-stream inference
•Implement ROS2 inference nodes
•Profile and benchmark on target hardware

Target Processes

•nn-model-optimization.js
•object-detection-pipeline.js
•rl-robot-control.js
•field-testing-validation.js

Dependencies

•TensorRT
•ONNX Runtime
•NVIDIA Jetson SDK
•DeepStream

Usage Context

This skill is invoked when processes require deploying ML models on edge devices with optimized inference performance.

Output Artifacts

•TensorRT engine files
•ONNX optimized models
•Quantization configurations
•DeepStream pipeline configs
•Inference benchmark reports
•ROS2 inference node implementations