PowersYourTeam

โš ๏ธ Failure Mode Reference

Root Cause Analysis / Defect Elimination Guide

Failure Modes
P-F Intervals
RCA Methods
DE Checklist
Bad Actors
โš™๏ธ Pumps - Common Failure Modes

๐Ÿ”„ Centrifugal Pumps

Cavitation VibrationUltrasound

Vapor bubble collapse causing pitting, noise, and reduced flow. Caused by insufficient NPSH, high suction lift, or restrictions.

Seal Failure VisualThermal

Leakage at mechanical seal due to wear, misalignment, dry running, or improper flush. Often secondary to other issues.

Impeller Wear Vibration

Erosion or corrosion of impeller vanes causing imbalance, reduced head/flow. Common with abrasive or corrosive fluids.

Bearing Failure VibrationThermalUltrasound

Spalling, pitting, or seizure. Caused by misalignment, lubrication issues, contamination, or overload.

Misalignment VibrationThermal

Angular or offset misalignment between pump and driver. Causes high vibration at 1X and 2X, coupling wear, seal damage.

Dry Running ThermalElectrical

Operation without fluid causing rapid seal and bearing damage. High motor current drop indicates loss of load.

โšก Motors - Common Failure Modes

๐Ÿ”Œ Electric Motors

Winding Insulation Breakdown MCE/MCAThermal

Degradation of stator insulation due to heat, contamination, or voltage stress. Leads to ground faults or phase-to-phase shorts.

Bearing Failure VibrationUltrasoundThermal

Most common motor failure. Caused by lubrication issues, contamination, shaft currents, or misalignment.

Rotor Bar/End Ring Defects ESA/CSAVibration

Cracked or broken rotor bars causing slip-related sidebands and current signature changes. Causes efficiency loss.

VFD-Induced Bearing Damage VibrationUltrasound

Shaft voltage discharge through bearings causing fluting/pitting. Requires shaft grounding or insulated bearings.

Shaft Imbalance Vibration

Uneven mass distribution causing 1X vibration. From manufacturing defects, buildup, or coupling issues.

Overheating ThermalElectrical

Excessive temperature from overload, voltage imbalance, poor cooling, or high ambient. Accelerates insulation aging.

๐Ÿ”ฉ Bearings - Common Failure Modes

โญ• Rolling Element Bearings

Fatigue Spalling VibrationUltrasound

Subsurface crack propagation leading to material flaking. Normal wear-out mode with identifiable bearing frequencies (BPFO, BPFI, BSF, FTF).

Lubrication Failure UltrasoundThermalOil Analysis

Insufficient, excessive, contaminated, or wrong lubricant. Causes metal-to-metal contact, heat generation, and rapid wear.

Contamination Oil AnalysisVibration

Ingress of particles, water, or process fluids causing abrasive wear and surface damage. Often from poor sealing.

False Brinelling Vibration

Wear marks from small oscillations while stationary. Common in standby equipment exposed to external vibration.

Misalignment Loading VibrationThermal

Uneven load distribution from shaft misalignment causing localized wear patterns and reduced life.

Electrical Fluting VibrationVisual

Washboard pattern on raceways from electrical discharge. VFD-related or poor grounding. Distinct high-frequency noise.

โš™๏ธ Gearboxes - Common Failure Modes

๐Ÿ”ง Gear Reducers/Increasers

Gear Tooth Wear VibrationOil Analysis

Progressive wear from normal operation, misalignment, or contamination. Shows gear mesh frequencies and harmonics.

Gear Tooth Breakage Vibration

Fracture from fatigue, overload, or impact. Strong 1X component with sidebands at gear mesh frequency.

Pitting/Spalling VibrationOil Analysis

Surface fatigue on tooth faces. Early pitting may self-heal; advanced pitting leads to tooth failure.

Lubrication Breakdown Oil AnalysisThermal

Oil degradation, contamination, or wrong viscosity causing inadequate film and accelerated wear.

Shaft Misalignment Vibration

Input/output shaft misalignment causing uneven tooth loading, high vibration, and bearing stress.

๐Ÿ“Š Failure Mode Categories & Detection Methods
Category Examples Primary Detection Secondary Detection Root Cause Areas
Mechanical Misalignment, imbalance, looseness, resonance Vibration Analysis Thermal, Ultrasound Installation, foundation, coupling
Electrical Insulation breakdown, rotor defects, connections Motor Testing (MCE/ESA) Thermal, Current Power quality, VFD, overload
Lubrication Under/over lubrication, contamination, wrong type Ultrasound, Oil Analysis Thermal, Vibration Procedures, training, sealing
Process Cavitation, dry running, off-design operation Process Parameters Vibration, Thermal Operation, design, control
Wear Fatigue, erosion, corrosion, abrasion Vibration, Oil Analysis Visual, NDT Material, environment, age
๐Ÿ“ˆ The P-F Curve
Time Asset Condition Normal Operation Ultrasound/Oil Analysis Vibration Analysis Thermal/Audible Visual/Obvious P Potential Failure F Functional Failure P-F Interval Task Interval (< P-F รท 2)
โฑ๏ธ P-F Intervals by Detection Method
1-9MONTHS
Ultrasound / Oil Analysis Early

Earliest detection of bearing wear, lubrication issues. Particle counts, wear metals, ultrasonic dB levels.

1-6MONTHS
Vibration Analysis Medium

Detects bearing defect frequencies, imbalance, misalignment. Requires baseline and trending.

1-4WEEKS
Infrared Thermography Late

Hot spots visible when significant friction/resistance present. Electrical connections, bearings, motors.

1-7DAYS
Audible Noise / Operator Observation Very Late

Obvious symptoms - grinding, squealing, smoke, visible damage. Failure often imminent.

โš ๏ธ Key Rule

Task Interval โ‰ค P-F Interval รท 2
This ensures you catch the failure between inspections. Example: If P-F = 3 months, inspect at least every 6 weeks.

๐Ÿ“‹ P-F Intervals by Failure Mode
Failure ModeP-F IntervalDetection Technology
Bearing fatigue (spalling)1-9 monthsVibration, Ultrasound, Oil
Lubrication breakdown1-6 monthsUltrasound, Oil Analysis
Gear tooth wear1-6 monthsVibration, Oil Analysis
Pump cavitation1-3 monthsVibration, Ultrasound
Motor insulation6-12 monthsMCE, Megger, PD
Belt wear1-3 monthsVisual, Vibration
Coupling wear1-4 monthsVibration, Visual
Seal degradation2-8 weeksVisual, Leak detection
Electrical connections1-4 weeksThermal, Resistance
Structural fatigueMonths-YearsNDT, Visual inspection
๐Ÿ” 5 Whys Method

Ask "Why?" repeatedly (typically 5 times) to drill down from symptom to root cause. Simple but effective for straightforward failures.

Problem: Pump bearing failed
Why 1: Bearing overheated โ†’ Inadequate lubrication
Why 2: Why inadequate lube? โ†’ Grease not applied on schedule
Why 3: Why not on schedule? โ†’ PM not in CMMS
Why 4: Why not in CMMS? โ†’ Never entered after installation
Why 5: Why not entered? โ†’ No commissioning checklist requirement
Root Cause: Inadequate commissioning procedures
๐ŸŸ Fishbone (Ishikawa) Diagram

Categorize potential causes into standard groups. Use the 6 M's for manufacturing/maintenance analysis.

EFFECT Machine Method Material Manpower Measurement Mother Nature Wear Procedure Training

The 6 M's

  • Machine: Equipment, tools, technology
  • Method: Procedures, processes
  • Material: Raw materials, consumables
  • Manpower: People, training, skills
  • Measurement: Inspection, data, calibration
  • Mother Nature: Environment, conditions
๐ŸŒณ Fault Tree Analysis (FTA)

Top-down deductive analysis starting from the undesired event (top event) and working down to identify all possible causes using logic gates.

Pump Failure OR Bearing Failure Seal Failure Motor Failure AND Poor Lube Misalign Contam OR Dry Run Wear Legend Top Event Intermediate Basic Event
๐Ÿ“Š ABC's of Failure

Research by Winston Ledet shows that failures originate from three source categories:

10%
Design & Engineering
A - ASSET
6%
Manufacturing & Materials
B - BUILD
84%
Operation & Maintenance
C - CARELESS

84% From "Careless" Work Habits

"Careless" doesn't mean irresponsible - it means not providing the care the equipment needs. This includes design engineers specifying wrong materials, maintenance engineers omitting failure modes from PM programs, or managers not acting on performance trends. These are controllable!

โœ… RCA Best Practices
  • Preserve evidence - photos, failed parts, data
  • Gather data before memories fade (within 24-48 hrs)
  • Include operators, maintainers, and engineers
  • Focus on system/process, not blame
  • Ask "Why?" until you reach controllable causes
  • Look for latent causes (decisions made months/years ago)
  • Verify root cause with data/evidence
  • Develop SMART corrective actions
  • Track implementation and verify effectiveness
  • Share lessons learned across organization
๐Ÿ“ Defect Elimination Checklist Generator

Select equipment type to generate a customized defect elimination checklist for inspections and walkdowns.

Equipment Checklist

0 of 0 items completed
๐ŸŽฏ Defect Elimination Process
1. Identify Bad Actors

Use Pareto analysis on downtime, maintenance cost, and failure frequency. 5-10% of equipment typically causes 80% of losses.

2. Conduct RCA

For significant failures, use 5 Whys, Fishbone, or FTA to find root causes. Focus on controllable causes.

3. Develop Corrective Actions

Create SMART actions addressing root causes. Consider design changes, procedure updates, training, and PM optimization.

4. Implement & Track

Enter actions into CMMS. Track completion and verify effectiveness at 6, 12, and 24 months.

5. Sustain - Fix Forever

Share lessons learned. Update procedures and training. The goal is "Fix Forever, Not Forever Fixing."

๐Ÿ”ง Common Defect Categories to Inspect
Category What to Look For Detection Method Typical Causes
Looseness Loose bolts, guards, covers, foundation Visual, vibration, touch Vibration, improper torque, thermal cycling
Leakage Oil, grease, water, process fluid leaks Visual, UV dye, ultrasound Seal wear, gasket failure, over-lubrication
Contamination Dirt, debris, water ingress, product buildup Visual, oil analysis Poor sealing, housekeeping, environment
Corrosion Rust, pitting, discoloration, scaling Visual, UT thickness Environment, chemical exposure, coating failure
Wear Belts, couplings, guards, seals showing wear Visual, measurement Normal operation, misalignment, contamination
Heat Hot spots on motors, bearings, connections Thermal imaging, touch Friction, electrical resistance, overload
Noise/Vibration Unusual sounds, excessive vibration Listen, feel, instruments Imbalance, misalignment, looseness, wear
Documentation Missing labels, outdated procedures, P&IDs Review, audit MOC gaps, poor practices
๐ŸŽฏ Bad Actor Analysis

"Bad Actors" are equipment that consistently underperforms or causes repeated failures. Typically 5-10% of equipment causes 80% of losses.

Identifying Bad Actors

Analyze your CMMS data to find equipment with:

  • Highest maintenance costs (parts + labor)
  • Most work orders (frequency)
  • Greatest downtime impact
  • Repeat failures of same component
  • Chronic condition monitoring alerts

โš ๏ธ Don't Confuse Symptoms with Causes

A "bad actor" pump may actually be a victim of upstream process issues, poor installation, or inadequate procedures. Always investigate the root cause before labeling equipment as problematic.

๐Ÿ“Š Pareto Analysis Example
$0 $50K $100K 0% 50% 100% 80% P-101 C-201 M-305 P-102 F-401 Other Top 3 = 80% of cost
๐Ÿ“‹ Bad Actor Action Plan Template
Step Action Owner Timeline Deliverable
1 Extract maintenance data from CMMS (12-24 months) Reliability Engineer Week 1 Data export, Pareto chart
2 Identify top 10 bad actors by cost/downtime Reliability Engineer Week 1 Prioritized equipment list
3 Form cross-functional team for top 3 Maintenance Manager Week 2 Team charter, meeting schedule
4 Conduct detailed failure analysis (RCA) RCA Team Weeks 2-4 RCA reports with root causes
5 Develop corrective actions with cost/benefit RCA Team Week 4 Action plan with ROI
6 Present and approve recommendations Management Week 5 Approved action plan
7 Implement changes (CMMS, procedures, design) Assigned owners Weeks 6-12 Completed actions
8 Monitor and verify effectiveness Reliability Engineer 6, 12, 24 months Performance tracking report
๐Ÿ“ˆ Metrics to Track
  • MTBF - Mean Time Between Failures
  • MTTR - Mean Time To Repair
  • Availability - Uptime percentage
  • Maintenance Cost - Per equipment/system
  • Work Order Count - CM vs PM ratio
  • Repeat Failures - Same failure mode recurrence
  • PdM Alerts - Condition monitoring findings
  • Safety Incidents - Related to equipment
๐Ÿ† Expected Results

Research by Winston Ledet (DuPont/Manufacturing Game) shows:

30%
Downtime reduction from PM/Planning
60%
Additional reduction from DE

โœ“ Total Potential: 90% Downtime Reduction

Best-performing sites achieve >98% uptime through the combination of effective planning/scheduling, preventive/predictive maintenance, and defect elimination. You cannot achieve world-class reliability without DE!