Apache Superset for Manufacturing OEE and Plant Performance
Build real-time OEE dashboards with Apache Superset. Monitor plant floor availability, performance, quality metrics—and reduce downtime without vendor lock-in.
Understanding OEE and Why It Matters for Modern Manufacturing
Overall Equipment Effectiveness (OEE) is a manufacturing metric that quantifies how well a production line or plant floor operates relative to its theoretical maximum. It’s the product of three components: availability (the percentage of time equipment runs when scheduled), performance (the speed at which it runs relative to design capacity), and quality (the percentage of products that meet specifications). When you multiply these three factors together, you get a decimal between 0 and 1—and that number drives every decision a manufacturing operations team makes.
The formula is straightforward: OEE = Availability × Performance × Quality. A plant running at 85% OEE is leaving 15% of its potential output on the table every single day. For a facility producing 1,000 units per shift, that’s 150 units of lost capacity, lost revenue, and lost competitive advantage. Understanding OEE in production measurement and improvement is critical because it forces you to identify where those losses actually occur—and that’s where data visibility becomes non-negotiable.
Traditional manufacturing operations teams rely on spreadsheets, manual data entry, and lag-time reporting. A shift supervisor might wait until end-of-day to learn that a bottleneck on Line 3 cost them 40 units of output. By then, the problem is already compounded. Real-time OEE dashboards change that equation. They give plant managers, engineers, and operations teams the visibility they need to act on downtime, speed losses, and quality issues the moment they occur—not after the damage is done.
This is where Apache Superset enters the picture. Apache Superset is an open-source data visualization and business intelligence platform that can be configured to ingest production data from your equipment, MES (Manufacturing Execution System), or ERP, and transform it into interactive, real-time OEE dashboards. Unlike proprietary BI platforms like Looker or Tableau, Superset gives you full control over your data stack, no vendor lock-in, and the flexibility to embed analytics directly into your production management systems.
The Three Pillars of OEE Measurement and Why Data Architecture Matters
Before you build dashboards, you need to understand what you’re measuring and where the data comes from. OEE breaks down into three distinct pillars, and each requires different data sources and calculation logic.
Availability: Tracking Scheduled vs. Actual Runtime
Availability measures the percentage of time your equipment actually runs during scheduled production time. The formula is simple: Availability = Run Time / Planned Production Time. But the data engineering is not.
To calculate availability, you need:
- Scheduled production windows: What hours is the line supposed to run? This often comes from your production schedule or MES.
- Actual run time: How many minutes did the equipment actually produce? This comes from your PLC (Programmable Logic Controller), SCADA system, or equipment sensors.
- Downtime events: Every stop, every pause, every maintenance window. You need timestamps, duration, and reason codes (unplanned breakdown, changeover, maintenance, etc.).
The challenge is that downtime data is often scattered. Your SCADA system logs equipment state changes. Your MES tracks scheduled maintenance. Your maintenance team logs repairs in a separate system. A robust OEE dashboard needs to pull from all three sources, normalize timestamps, and reconcile discrepancies.
OEE systems in manufacturing require this integration—and Apache Superset’s API-first architecture makes it possible to connect to multiple data sources, apply transformation logic at the SQL layer, and present a unified availability metric that everyone trusts.
Performance: Measuring Speed and Throughput
Performance (also called production efficiency) measures how fast your equipment runs relative to its design speed. The formula is: Performance = Actual Throughput / Theoretical Maximum Throughput.
If your production line is designed to produce 100 units per hour but you’re only producing 85 units per hour, your performance is 85%. This loss can stem from minor speed reductions, micro-stops (brief pauses under 1 minute), or intentional slowdowns due to material quality or operator constraints.
Measuring performance requires:
- Theoretical cycle time: How long should it take to produce one unit? This is typically defined in your engineering specs or MES.
- Actual cycle time: How long did it actually take? This comes from production counters, vision systems, or manual logging.
- Scrap and rework: Units that don’t count toward throughput because they’re defective or require rework.
Performance is where real-time dashboards shine. A plant floor manager can see within seconds that Line 2 is running at 78% performance instead of 92%, investigate why (is it a material issue, an operator issue, or equipment degradation?), and take corrective action before the shift ends.
Quality: Defect Rate and First-Pass Yield
Quality measures the percentage of units produced that meet specification without rework or scrap. The formula is: Quality = Good Units / Total Units Produced.
Quality data comes from:
- Inline inspection systems: Automated vision systems, coordinate measuring machines (CMMs), or other sensors that detect defects in real-time.
- Post-production inspection: Manual or automated quality checks after production.
- Customer returns and field failures: Defects that escape production but are caught downstream.
- Rework tracking: Units that failed initial inspection but were repaired.
A quality metric of 98% sounds good until you realize that 2% of your output is scrap or rework—which costs material, labor, and time. Visualizing production performance using OEE dashboards reveals where quality issues originate, whether they’re systemic (a particular machine consistently produces defects) or sporadic (a specific shift or operator).
Building Your Data Foundation: From Equipment to Dashboard
Apache Superset is a visualization and query layer, not a data warehouse. To build effective OEE dashboards, you need a solid data foundation underneath. Here’s the typical architecture:
Data Sources and Integration Points
Your OEE data likely lives in multiple systems:
- PLC/SCADA systems: Real-time equipment state, cycle counts, temperature, pressure, and other process variables. Usually accessed via OPC-UA, MQTT, or REST APIs.
- MES (Manufacturing Execution System): Production schedules, work orders, downtime logs, quality results, and material tracking.
- ERP system: Bill of materials, production costs, inventory, and historical production data.
- Historian database: Time-series data from equipment sensors, often stored in InfluxDB, TimescaleDB, or similar.
- Standalone quality systems: SPC (Statistical Process Control) software, lab management systems, or inspection equipment databases.
Apache Superset can query most of these directly via JDBC drivers or REST APIs. But the real power comes from staging this data in a central warehouse or data lake—a PostgreSQL database, Snowflake, BigQuery, or similar—where you can normalize timestamps, calculate OEE metrics, and store historical trends.
Calculating OEE at the Database Layer
Instead of calculating OEE in the dashboard (which is slow and brittle), calculate it in your data warehouse using SQL. Here’s a simplified example:
WITH availability_calc AS (
SELECT
line_id,
shift_date,
SUM(CASE WHEN equipment_state = 'running' THEN duration_minutes ELSE 0 END) as run_time_minutes,
SUM(duration_minutes) as planned_time_minutes,
ROUND(100.0 * SUM(CASE WHEN equipment_state = 'running' THEN duration_minutes ELSE 0 END) / SUM(duration_minutes), 2) as availability_pct
FROM equipment_logs
WHERE shift_date >= CURRENT_DATE - INTERVAL '30 days'
GROUP BY line_id, shift_date
),
performance_calc AS (
SELECT
line_id,
shift_date,
COUNT(*) as units_produced,
COUNT(*) * line_target_cycle_time_sec / (SUM(shift_duration_sec)) as performance_pct
FROM production_logs
GROUP BY line_id, shift_date
),
quality_calc AS (
SELECT
line_id,
shift_date,
COUNT(CASE WHEN quality_status = 'pass' THEN 1 END) as good_units,
COUNT(*) as total_units,
ROUND(100.0 * COUNT(CASE WHEN quality_status = 'pass' THEN 1 END) / COUNT(*), 2) as quality_pct
FROM quality_logs
GROUP BY line_id, shift_date
)
SELECT
a.line_id,
a.shift_date,
a.availability_pct,
p.performance_pct,
q.quality_pct,
ROUND(a.availability_pct * p.performance_pct * q.quality_pct / 10000, 2) as oee_pct
FROM availability_calc a
JOIN performance_calc p ON a.line_id = p.line_id AND a.shift_date = p.shift_date
JOIN quality_calc q ON a.line_id = q.line_id AND a.shift_date = q.shift_date
ORDER BY a.shift_date DESC, a.line_id;
This approach—calculating OEE upstream in your data warehouse—is far more efficient and maintainable than trying to build it inside Superset. Once this table exists, Superset simply queries it and visualizes the results.
Designing OEE Dashboards in Apache Superset
Now that you have clean, calculated OEE data, it’s time to build dashboards that your operations team will actually use. A good OEE dashboard answers three questions immediately:
- What is our current OEE, and how does it compare to target?
- Which line, shift, or product is dragging down overall performance?
- What is the root cause of the gap—availability, performance, or quality?
The Executive OEE Overview
Start with a single-screen dashboard that shows the big picture. This is what plant managers and operations directors see when they walk in:
- Large OEE gauge or KPI card showing current plant OEE (e.g., 82%) vs. target (e.g., 85%), with a color indicator (red, yellow, green).
- OEE by production line: A horizontal bar chart showing each line’s OEE, sorted from lowest to highest. This immediately highlights problem lines.
- Availability, Performance, Quality breakdown: Three side-by-side gauges showing the contribution of each pillar to overall OEE. If availability is 95%, performance is 88%, and quality is 97%, it’s obvious that performance is the bottleneck.
- Downtime events (last 24 hours): A table showing the most recent unplanned stops—duration, line, reason code, and impact. This drives immediate action.
- Trend chart: A line chart showing OEE over the last 7-30 days, with a target line overlaid. Is OEE improving, stable, or declining?
D23 is the modern BI platform built on Apache Superset and can help you deploy these dashboards quickly, with pre-built templates for manufacturing use cases and expert guidance on data architecture.
The Detailed Line Performance Dashboard
Plant engineers and shift supervisors need deeper visibility. A second dashboard drills into a specific production line:
- Line-level OEE gauge and target.
- Availability breakdown: A stacked bar chart showing planned production time, actual run time, and downtime. A pie chart showing downtime by reason code (unplanned breakdown, changeover, maintenance, etc.).
- Performance trend: A line chart showing actual throughput vs. target throughput, hour by hour. When does the line slow down? Is it consistent or sporadic?
- Quality by shift: A table showing defect rate by shift, with the ability to filter by defect type. Is quality worse on the night shift? For a particular product?
- Cycle time analysis: A histogram showing the distribution of cycle times. Are most units produced at target speed, or is there a long tail of slow cycles?
The Root Cause Investigation Dashboard
When OEE drops, you need to investigate. A third dashboard is designed for troubleshooting:
- Date and line filters so you can zoom into a specific time period and equipment.
- Detailed downtime log: Timestamp, duration, reason code, operator notes. Are there patterns? Does a particular operator have more downtime?
- Correlation analysis: Quality vs. throughput. When defect rate spiked on Tuesday, did throughput also drop? This suggests a common root cause (e.g., material batch issue).
- Equipment health metrics: Temperature, pressure, vibration trends. Is equipment degradation causing the OEE loss?
- Operator and shift analysis: Is performance correlated with specific operators, shifts, or days of the week? This informs training and scheduling decisions.
OEE dashboards based on ISO 22400 standards help you standardize these metrics across your organization, so every plant measures OEE the same way.
Advanced Capabilities: AI, Alerts, and Embedded Analytics
Apache Superset’s core strength is visualization, but modern manufacturing dashboards need more.
Text-to-SQL and Natural Language Queries
Many plant floor users aren’t SQL experts. They ask questions like “Why did downtime spike on Line 3 yesterday?” or “Which products have the highest defect rate this month?” With D23’s AI-powered text-to-SQL capabilities, you can enable operators and supervisors to ask these questions in plain language, and the system automatically generates and executes the appropriate SQL query.
This democratizes analytics. Your shift supervisor doesn’t need to wait for an analyst to pull a report—they get answers in seconds.
Alerting and Notifications
Dashboards are passive. Alerts are active. Apache Superset supports alerts on chart thresholds: if OEE drops below 80%, if a line hasn’t produced a unit in 15 minutes, if defect rate exceeds 3%, send an email or Slack notification to the relevant team.
For manufacturing, this is critical. The faster you know about a problem, the faster you can respond. A 15-minute detection delay on a production line costs you hundreds of dollars in lost output.
Embedding Analytics in Your Production Systems
Many manufacturing teams use custom production management systems—homegrown applications, industry-specific software, or systems built on top of their MES. Apache Superset’s API and embedding capabilities allow you to embed OEE dashboards directly into these systems.
Instead of operators toggling between their production system and a separate BI tool, they see OEE metrics and drill-down reports right where they work. This reduces friction, increases adoption, and accelerates decision-making.
D23 provides API-first BI and MCP server integration specifically for this use case—embedding self-serve analytics into production and engineering platforms without the overhead of managing Superset infrastructure yourself.
Comparing Apache Superset to Proprietary Alternatives for Manufacturing
Looker, Tableau, and Power BI are powerful BI platforms, but they come with tradeoffs for manufacturing organizations.
Cost
Tableau and Looker are licensed per user, with annual costs ranging from $70-$200+ per user depending on features. For a plant with 50 operators, supervisors, and engineers, that’s $3,500-$10,000 per year just in licensing. Apache Superset is open-source and free. D23’s managed Superset service charges for hosting and support, but typically costs 40-60% less than proprietary platforms at scale.
Data Residency and Security
Manufacturing data is sensitive. Intellectual property, production volumes, quality metrics—all of it can be competitively sensitive. With proprietary platforms, your data flows through the vendor’s infrastructure. With Apache Superset, you control where your data lives. It stays in your data warehouse, in your cloud account, or on-premises. No data leaves your network unless you explicitly configure it to.
Customization and Flexibility
Propriety platforms have opinionated UI/UX. You build dashboards within the constraints of their design system. Apache Superset is open-source. You can customize the interface, add custom visualizations, integrate with your existing tools, and modify the code to fit your specific needs.
For manufacturing, this matters. You might want to embed OEE dashboards in a custom production control system. You might want to integrate with your MES via REST APIs. You might want to build a custom visualization that shows your plant layout with real-time OEE overlaid on each production line. Apache Superset’s flexibility makes all of this possible.
Speed of Implementation
Top OEE software solutions for manufacturing vary widely in deployment speed. Proprietary platforms often require lengthy implementations—months of configuration, data mapping, and training. Apache Superset can be up and running in weeks, especially if you partner with a consulting firm experienced in manufacturing analytics.
Real-World Implementation: From Data to Dashboards
Let’s walk through a realistic manufacturing scenario. You’re a plant manager at a mid-sized automotive parts supplier. You have three production lines, each running two shifts per day. Your MES logs production data, your quality system logs defects, and your SCADA system logs equipment state. But you have no real-time visibility into OEE. You find out about problems at end-of-shift or the next morning.
Week 1-2: Data Architecture
You work with a data engineer to:
- Export historical production, quality, and equipment data from your MES, quality system, and SCADA into a staging area (S3, Azure Blob, or a local database).
- Set up real-time data pipelines using tools like Kafka, Airflow, or AWS Lambda to continuously ingest new data.
- Build a data warehouse (PostgreSQL, Snowflake, or similar) with normalized tables for production events, quality results, and equipment logs.
- Write SQL queries that calculate OEE, availability, performance, and quality metrics.
Week 3: Dashboard Development
You deploy Apache Superset and build the three dashboards described earlier: executive overview, line-level detail, and root cause investigation. You configure filters for date range, production line, and shift. You set up alerts for OEE < 80%.
Week 4: Rollout and Training
You train your shift supervisors, maintenance team, and plant management on how to use the dashboards. You establish a daily stand-up where the team reviews overnight OEE trends and discusses action items.
Month 2+: Optimization
You start seeing patterns. Line 2 consistently underperforms on the night shift—is it a staffing issue, equipment degradation, or material batch issue? You correlate OEE with external factors (weather, supplier, operator, product type) and identify root causes. You implement corrective actions and measure the impact in real-time.
Within 3-6 months, you’ve improved plant OEE from 78% to 84%—a 6-point improvement that translates to 8-10% more output without additional capital investment.
Measuring Success: KPIs and ROI
How do you know if your OEE dashboard investment is working? Track these metrics:
Operational KPIs
- OEE improvement: Baseline vs. current. A 5-10% improvement is realistic within 6 months of implementing real-time dashboards.
- Mean Time to Detection (MTTD): How long before you notice a problem? With real-time dashboards, this should be minutes, not hours.
- Mean Time to Resolution (MTTR): How long to fix a problem once you know about it? Real-time root cause data accelerates this.
- Downtime reduction: Unplanned downtime as a percentage of scheduled time. Real-time visibility and alerts reduce unplanned stops.
- Quality improvement: Defect rate reduction. When you can see quality issues in real-time and correlate them with equipment, material, and operator factors, you fix them faster.
Financial ROI
- Revenue impact: OEE improvement directly translates to more output. If you improve OEE by 5%, and your plant produces $10M in annual output, that’s $500K in incremental revenue.
- Cost savings: Fewer unplanned downtime events mean less emergency maintenance, less overtime, and less scrap. These savings compound.
- Implementation cost: Data architecture, dashboard development, training, and ongoing support. With Apache Superset, this is typically 40-60% cheaper than proprietary platforms.
How to calculate OEE performance provides detailed guidance on measurement methodologies that ensure your improvements are real and auditable.
Scaling Across Multiple Plants and Facilities
If you manage multiple manufacturing facilities, a centralized OEE dashboard becomes even more valuable. You can:
- Compare performance across plants: Which facility has the best OEE? What are they doing differently?
- Identify best practices: If Plant A’s Line 2 achieves 92% OEE and Plant B’s equivalent line achieves 78%, you can investigate and replicate the better practices.
- Allocate resources: If one plant is consistently underperforming, you can prioritize maintenance, training, or capital investment there.
- Benchmark against industry standards: Typical OEE benchmarks vary by industry, but 85%+ is generally considered world-class. Are you hitting that across all facilities?
Maximizing production potential with OEE knowledge helps you establish realistic targets and drive continuous improvement across your organization.
Apache Superset scales to support this easily. A single Superset instance can serve dozens of plants, with row-level security ensuring that Plant A operators only see their facility’s data.
Integration with Existing Manufacturing Systems
Your OEE dashboards don’t exist in isolation. They need to integrate with your existing manufacturing ecosystem.
MES Integration
Your MES is the source of truth for production schedules, work orders, and downtime logs. Apache Superset can query your MES database directly via JDBC, or you can set up real-time data pipelines that push MES data into your warehouse.
ERP Integration
Your ERP contains bill of materials, production costs, and inventory data. By joining ERP data with production and quality data, you can analyze OEE in the context of product profitability. Maybe Line 1 has lower OEE but produces higher-margin products, so the lower efficiency is acceptable. Maybe Line 2 has high OEE on low-margin products, suggesting you should shift capacity to higher-margin work.
Equipment Integration
Modern production equipment (CNC machines, injection molders, assembly robots) often have built-in connectivity. You can pull real-time cycle counts, temperature, pressure, and other process variables directly into your dashboards. This enables predictive maintenance—when you see equipment parameters degrading, you schedule maintenance before failure occurs.
Quality System Integration
Automated inspection systems, CMMs, and SPC software generate quality data. Integrating this into your OEE dashboard allows you to see quality trends in real-time and correlate quality issues with equipment, material, and operator factors.
Security, Compliance, and Data Governance
Manufacturing data is sensitive. Your OEE dashboards must be secure and compliant.
Access Control
Apache Superset supports role-based access control (RBAC). You can define roles (operator, supervisor, plant manager, executive) with different dashboard and data access levels. Plant A operators only see Plant A data. Maintenance teams see downtime details but not quality data. Executives see aggregated metrics across all plants.
Data Encryption
Data should be encrypted in transit (TLS/SSL) and at rest. D23’s managed service handles encryption and security infrastructure, so you don’t have to.
Audit Logging
Who accessed which dashboards, when, and what data did they view? Apache Superset logs all of this, enabling audit trails for compliance and security investigations.
Data Retention and Deletion
Manufacturing data can have retention requirements (e.g., quality data must be kept for 5 years for traceability). Your data architecture should support archiving old data and purging data when retention periods expire.
Review D23’s terms of service and privacy policy to understand how data is handled in a managed environment.
Getting Started: Roadmap and Next Steps
If you’re a manufacturing operations leader considering Apache Superset for OEE dashboards, here’s a realistic roadmap:
Phase 1: Assessment and Planning (Weeks 1-2)
- Audit your current data sources (MES, SCADA, quality systems, ERP).
- Define OEE targets and KPIs for your organization.
- Identify key stakeholders (shift supervisors, maintenance, plant management, executive team).
- Estimate data volume and refresh frequency.
- Decide: self-hosted Superset or managed service like D23?
Phase 2: Data Architecture (Weeks 3-6)
- Set up a data warehouse or data lake.
- Build data pipelines to ingest production, quality, and equipment data.
- Write SQL queries to calculate OEE metrics.
- Validate data quality and reconcile discrepancies.
Phase 3: Dashboard Development (Weeks 7-10)
- Deploy Apache Superset.
- Build the three core dashboards (executive overview, line detail, root cause investigation).
- Configure alerts and notifications.
- Set up access control and security.
Phase 4: Rollout and Adoption (Weeks 11-12)
- Train users on dashboard navigation and interpretation.
- Establish daily/weekly review cadences.
- Gather feedback and iterate on dashboard design.
- Document processes and best practices.
Phase 5: Optimization (Ongoing)
- Monitor OEE trends and identify improvement opportunities.
- Correlate OEE with external factors (material, operator, equipment age).
- Implement corrective actions and measure impact.
- Expand dashboards to include predictive analytics, anomaly detection, and AI-assisted root cause analysis.
Conclusion: The Case for Apache Superset in Manufacturing
OEE is the north star metric for manufacturing operations. It quantifies efficiency, drives decision-making, and directly impacts profitability. Real-time OEE dashboards transform operations from reactive (finding out about problems hours later) to proactive (detecting and responding to issues in minutes).
Apache Superset, especially when paired with expert data consulting and managed infrastructure, provides the visibility, flexibility, and cost-effectiveness that manufacturing organizations need. Unlike proprietary platforms, Superset gives you control over your data, the ability to customize for your specific needs, and the freedom to integrate with your existing systems.
Whether you’re a plant manager trying to improve a single facility, a manufacturing director managing multiple plants, or a private equity firm standardizing analytics across a portfolio of companies, Apache Superset for OEE dashboards is a pragmatic, proven approach that delivers real operational and financial results.
Start with a clear understanding of your data sources and OEE targets. Build a solid data foundation in your warehouse. Deploy focused dashboards that answer the questions your team actually asks. Train your users. Measure impact. Iterate. Within months, you’ll see OEE improvements, downtime reductions, and quality gains that justify the investment many times over.