What is FMEA in simple terms?

FMEA (Failure Mode and Effects Analysis) is a structured way to ask: How can this fail, what happens when it does, and how do we prevent it? On site, that means listing real failure modes – belt mistracking, plugged chutes, overheated bearings, missed lubrication, wrong lockout point – and ranking them so you focus on what actually threatens uptime or safety.

What’s the difference between FMEA, PFMEA, and DFMEA?

FMEA is the overall method. DFMEA (Design FMEA) looks at equipment design such as new crushers, retrofits, guard changes. It asks: Did we design out the risk? PFMEA (Process FMEA) looks at how you operate and maintain — start-ups, inspections, PMs, shift handovers. It asks: Where can our process break down? If you're running a plant, PFMEA is usually where the biggest gains are.

What is a failure mode?

A failure mode is the specific way something fails. Not “conveyor downtime.” Instead: Belt mistracking on a carry-side idler Bearing overheating on a primary crusher Hydraulic hose chafing against a frame Batch plant scale drifting out of tolerance The more specific you are, the easier it is to prevent the repetition.

What is a Risk Priority Number (RPN)?

Risk Priority Number (RPN) is commonly calculated as: Severity × Occurrence × Detection Severity: How bad is it? Safety issue? Environmental spill? Full plant stop? Occurrence: How often does it happen? Detection: Will we catch it early – or only after failure? RPN helps you decide where to spend limited maintenance hours. But high-severity safety risks should always move to the front of the line, regardless of the math.

What’s the biggest mistake teams make with FMEA?

They score the risks and then stop there. An FMEA only works if high-risk items turn into action: New inspection checkpoints Adjusted PM intervals Design improvements Better lockout/tagout steps Stocking the right spares If nothing changes on the floor, the FMEA becomes paperwork. If it drives daily routines, it protects uptime.

Why Your FMEA Is Sitting in a Binder (And What It Should Be Doing Instead)

Published: 2026-05-07
Written by: Anju Khanna Saggi

Share this post with others:

There’s usually a three‑ring binder somewhere on site with “FMEA” (Failure Mode and Effects Analysis) on the spine. It was built during a shutdown or a quality push, printed, filed, and likely forgotten while the day-to-day work on site kept moving.

Needless to say, an FMEA only matters if it changes decisions on the ground. And the ground gets expensive fast. In ABB’s “Value of Reliability” survey of 3,215 maintenance decision‑makers, over two‑thirds of industrial businesses said they experience unplanned outages at least once a month, with a typical hourly cost close to $125,000. Yet 21% still rely on run‑to‑fail maintenance.

In this article, we’ll delve into the importance of FMEA, and how you SHOULD be using it on a daily basis, including:

What FMEA Does
The Purpose of a FMEA
When a FMEA doesn’t get used
FAQ

What FMEA Does

Failure Mode and Effects Analysis (FMEA) is a step‑by‑step way to identify failure modes, understand their effects, and prioritize what to prevent first. “Failure mode” is plain language: how a thing fails – belt mistracking, plugged chute, cracked weld, wrong lockout point, missed lubrication. Most teams rank each item with a risk priority number (RPN), often Severity × Occurrence × Detection.

Two distinct branches of FMEA matter in industrial operations:

DFMEA looks at design decisions: a new crusher, a retrofit, a guard redesign, a dust collector upgrade.
PFMEA looks at how you run and maintain: start‑up/shutdown, PM routines, inspection points, handovers, and material flow.

These are formalized heavily in sectors like automotive (AIAG & VDA publish a DFMEA/PFMEA handbook), but the thinking is universal and applies to any type of industrial setting.

The Purpose of a FMEA

A living FMEA is not a compliance tick-box exercise. It’s a prioritized list of “how this plant breaks,” connected to controls your crew can execute. Keep it simple. Start with one constraint asset (primary crusher line, batch plant, your worst conveyor) and do this:

List failure modes in shop terms. “Tail pulley bearing cooks.” “Belt slices on skirt steel.” “Hydraulic hose rubs through.” “Screen blinds in wet feed.”
Make Scoring Detection honest. If you only catch the problem when you smell burning rubber, Detection is poor. If a five‑minute walkaround, a temperature‑gun route, vibration check, or oil sample catches it early, score it better.
Use RPN to rank work, then force an action. An RPN number without a control is just math. Convert your top items into something real, such as an added inspection point, a changed PM interval, a stocked seal kit, a training step. Whatever closes the gap between the score and the fix.
Don’t let RPN hide severity. High-severity safety and compliance items deserve action even if occurrence feels 'rare.' A low probability doesn't make the consequence acceptable.
Close the loop after breakdowns. If the same failure mode hits twice, update occurrence and update the control plan. If you solved it, capture what worked so the fix survives the next crew and the next shift.
Make it accessible where the work happens. If your FMEA, PFMEA, or DFMEA lives in a spreadsheet on someone’s desktop, it won’t survive shift change. The highest-risk failure modes should show up inside daily inspections, PM tasks, and deviation reports – on the tablet in the loader cab or the screen in the shop. When crews can see the top risks, log photos, and update actions in the same place they document the work, and the analysis with resolutions stays alive.

When a FMEA Doesn’t Get Used

Most site FMEAs die because they’re disconnected from work. They’re written in “meeting room language,” they don’t tie to pre‑shift checks and PMs, and nobody owns updates after the next changeout or workaround.

And that’s where most FMEAs fall apart.

If high-risk failure modes aren’t built into daily inspections, they get forgotten. If repeat breakdowns don’t automatically trigger a review of Occurrence scoring, the RPN never changes. If deviations live in one system and PMs in another, nobody sees the pattern forming.

A living FMEA needs to sit where the work actually happens – in the same flow as pre-shift checks, workplace exams, PM tasks, and incident reporting. When a tech logs a failed bearing with a photo, that data should feed the risk picture. When a supervisor closes a deviation, it should affect how Detection is scored next time.

That’s when FMEA stops being a workshop exercise.

FAQ

Risk Priority Number (RPN) is commonly calculated as: Severity × Occurrence × Detection

Severity: How bad is it? Safety issue? Environmental spill? Full plant stop?
Occurrence: How often does it happen?
Detection: Will we catch it early – or only after failure?

RPN helps you decide where to spend limited maintenance hours. But high-severity safety risks should always move to the front of the line, regardless of the math.

Share this post with others:

Want to know what CheckProof can do for you?

CheckProof's easy-to-use app makes it easier to do the right thing at the right time. Discover how you can run world-class maintenance that is both cost-effective and sustainable.

Book a demo

2026-05-07

Why Your FMEA Is Sitting in a Binder (And What It Should Be Doing Instead)

Most facilities have an FMEA — but it’s collecting dust in a binder instead of driving decisions on the floor. Here’s how to turn your risk analysis into a living tool that actually protects uptime.