Gridware exists to enhance and protect the mother of all networks: the electrical grid. The grid touches everyone and makes our modern economy possible. But it’s also fragile. When the grid is compromised, everything grinds to a halt, and the consequences can be dire: wildfires burn, land is destroyed, property is damaged, progress stops, and lives are lost.
Our team builds smart sensors that help utility companies to immediately detect, find, and fix outages and take steps to prevent new outages, and other related disasters, from happening at all. The need for power will only increase. We protect the grid of today while we build the grid of tomorrow.
Gridware is privately held and backed by the best climate-tech and Silicon Valley investors. We are headquartered in the Bay Area in northern California.
About Gridware
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. We pioneered a groundbreaking new class of grid management called active grid response (AGR), focused on monitoring the electrical, physical, and environmental aspects of the grid that affect reliability and safety. Gridware’s advanced Active Grid Response platform uses high-precision sensors to detect potential issues early, enabling proactive maintenance and fault mitigation. This comprehensive approach helps improve safety, reduce outages, and ensure the grid operates efficiently. The company is backed by climate-tech and Silicon Valley investors. For more information, please visit www.Gridware.io.
Position Overview
The Fleet Engineer will be responsible for ensuring the health, reliability, and uptime of Gridware’s deployed devices through proactive monitoring, diagnostics, and cross-functional coordination. This role requires close collaboration with the engineering, manufacturing, operations, and customer success teams to identify root causes of fleet issues and implement preventative solutions at scale. The ideal candidate will bring a strong technical foundation across hardware, firmware, and software systems, a passion for hands-on problem solving, and a drive to develop tools and processes that support a rapidly growing network of remote IoT devices.
This describes the ideal candidate; many of us have picked up this expertise along the way. Even if you meet only part of this list, we encourage you to apply!
Benefits
Health, Dental & Vision (Gold and Platinum with some providers plans fully covered)
Paid parental leave
Alternating day off (every other Monday)
“Off the Grid”, a two week per year paid break for all employees.
Commuter allowance
Company-paid training
Responsibilities
Develop the multi-disciplinary skillset and knowledge base to monitor, analyze, and predict fleet health to enable 99% uptime for Gridware devices Serve as the technical and operational expert for deployed hardware to inform product development and operational efficiency Fleet Monitoring, Diagnostics & Root Cause Analysis
Continuously monitor device health, uptime, and telemetry data to detect anomalies. Develop and refine dashboards and alerts to surface hardware/software/firmware issues proactively. Identify root causes of performance degradation, communication failures, or hardware faults. Automation & Tooling
Build or extend tools/scripts to automate playbooks for common diagnostics, issue classification and corrective actions. Analyze historical device health and environmental conditions to forecast device health and preventative measures to apply for maximizing device uptime Firmware and Configuration Management
Manage over-the-air (OTA) updates for firmware and configurations, including version rollout strategies, validation, and rollback procedures. Coordinate staged rollouts with validation in test/staging environments. Assess and characterize proposed fleet changes to balance device health stability with detection and operational improvements for scale Hardware Reliability & Field Feedback
Track and analyze hardware failure trends; work with engineering teams to improve design, durability, and manufacturability. Incorporate field insights to inform future hardware revisions. Extract failure analysis insights to develop symptom classification, diagnostics, and respective prognostics Cross-Functional Collaboration
Collaborate with customer success, engineering, manufacturing, and operations to resolve field issues efficiently. Serve as technical point-of-contact for escalations related to fleet health. Maintain documentation of device architecture, fleet operations processes, and known issues. Required Skills
Bachelor's degree in mechanical engineering, Electrical Engineering, Computer Engineering, Computer Science, Mechatronics, Robotics, or related field. 2+ years of experience in test engineering, failure analysis, fleet management, technical operations roles preferably in electronics, automotive or related field. At least 2 years of coding in a high-level language (e.g., Python, SQL, MATLAB) Strong understanding of embedded systems, wireless communication (e.g., LTE, NB-IoT, LoRa), and hardware/software interactions. Strong analytical and communication abilities to translate extremely technical & complex topics to non-technical cross-functional stakeholders. Ability to work both independently and collaboratively in a fast-paced environment, managing multiple priorities with a positive attitude. A passion for sustainability and a keen interest in the energy industry is a strong advantage. Bonus Skills
Experience scaling systems & processes from tens of thousands to hundreds of thousands of devices. Familiarity with data analysis tools (e.g., Grafana, Databricks). Exposure to power-constrained or solar-powered devices.