Search
Filters

Thermal protection interlock control for transistor modules

Transistor Module Overheat Protection Linkage Control: How to Save Your Power Stage Before It Dies

Overheating does not always announce itself with smoke and flames. More often, a transistor module silently degrades over thousands of thermal cycles — bond wires lift, solder layers crack, and the junction temperature climbs higher with each cycle until one day the device fails catastrophically. Overheat protection is not a nice-to-have feature. It is the last line of defense between a controlled shutdown and a destroyed module. But protection alone is not enough. The real question is how you link that protection to the rest of your system so the response is fast, coordinated, and actually saves the hardware.

Why Simple Temperature Cutoff Is Not Enough

Most engineers implement overheat protection by monitoring the junction temperature and shutting down the gate drive when a threshold is crossed. That works in theory. In practice, it fails for three reasons.

First, temperature sensors have lag. A thermistor or diode sensor mounted on the baseplate or heatsink reads the case temperature, not the junction temperature. By the time the case reaches the trip point, the junction could already be 20 to 40 degrees hotter. For IGBT modules, that delay of a few milliseconds can be the difference between a safe shutdown and thermal runaway.

Second, a hard shutdown creates its own problems. When you kill the gate drive instantly, the inductive load dumps its stored energy into the freewheeling diodes or snubber network. This generates voltage spikes that can exceed the module's safe operating area. So the protection that was supposed to save the module ends up stressing it in a different way.

Third, temperature-only protection ignores the root cause. A module can overheat because of overcurrent, excessive switching frequency, poor cooling, or a combination of all three. If your protection only reacts to temperature, you are treating the symptom, not the disease. The module survives this time, but the underlying problem remains, and it will fail again — probably sooner.

Building a Linked Protection System That Actually Works

A proper overheat protection scheme does not live in isolation. It talks to the gate driver, the current sensor, the cooling system, and the main controller. When one parameter goes out of bounds, the others adjust proactively instead of waiting for a hard trip.

Combining Temperature and Current Feedback for Early Warning

The most effective protection strategy uses both junction temperature and collector current as inputs. Current is the leading indicator — it rises before temperature does. If the current exceeds a safe threshold for even a few microseconds, the protection system can reduce the PWM duty cycle or lower the switching frequency before the junction has time to heat up.

This is called derating. Instead of waiting for the temperature to hit 150 degrees and then shutting down, the controller gradually reduces power output as current climbs. For IGBT modules, a common approach is to scale the maximum allowable duty cycle inversely with the measured current. At half the rated current, you can run at full duty. At 80 percent of rated current, the duty cycle drops to 60 percent. At 100 percent rated current, the duty cycle is limited to 40 percent. This keeps the junction temperature in a safe band without ever triggering a hard shutdown.

The linkage between current sensing and gate drive is critical here. The gate driver must be able to accept an external analog signal or a digital command that reduces the PWM duty cycle in real time. Most modern gate driver ICs support this through a fault input pin or a PWM input with variable reference. Use it. Do not rely on the main MCU to kill the PWM after the fact — the latency is too high.

Using Desaturation Detection as a Fast Overheat Proxy

Desaturation detection is one of the most underused protection features in transistor module drive circuits. When an IGBT module is fully on, the collector-emitter voltage should be low — typically 1.5 to 3 volts. If the device is not fully saturated, that voltage rises sharply to the bus voltage level. This happens when the module is overloaded, shorted, or overheating and losing gain.

A desat detection circuit monitors the VCE voltage during the on-state. If it exceeds a threshold — usually 7 to 9 volts — the driver immediately pulls the gate low and triggers a fault. This response happens in under 5 microseconds, which is orders of magnitude faster than any temperature-based protection.

The trick is linking desat detection to your overheat system. When a desat fault fires, do not just shut down and wait. Log the event, reduce the allowable current for the next startup, and alert the controller that the module is under stress. If desat faults happen repeatedly, the system should refuse to restart until a cooling period has elapsed. This prevents the module from being hammered back into service before it has recovered.

Cooling System Integration: The Forgotten Half of Overheat Protection

Active Cooling Response Triggered by Temperature Feedback

Most protection schemes focus on reducing power when temperature rises. But you can also increase cooling at the same time. This is where linking the protection circuit to the fan or liquid cooling system pays off.

When the junction temperature reaches 80 percent of the maximum rating, ramp up the fan speed or increase the coolant flow rate. This is a proportional response — not on-off. A sudden jump to full fan speed creates mechanical stress and acoustic noise. A gradual ramp keeps the system quiet and extends fan life. For liquid-cooled systems, increase the pump speed or open a bypass valve to push more coolant through the cold plate.

The key is that this cooling response must happen before the temperature trip point. If you wait until the module is already at 150 degrees to turn on the fan, you have already lost the battle. The cooling system should be running at full capacity by the time the module reaches 100 degrees. This requires a control loop with enough bandwidth to react in seconds, not minutes.

Thermal Runaway Prevention Through Power Derating Curves

Every transistor module has a safe operating area defined by current, voltage, and temperature. Outside that area, the device fails. The protection system must enforce this boundary dynamically, not just at a single fixed point.

Implement a derating curve in your firmware. At low temperature, the module can handle full current. As temperature climbs, the maximum allowable current drops linearly or according to the datasheet curve. At the maximum junction temperature, the current limit drops to zero — which means shutdown. This curve should be stored in the controller's memory and applied to the PWM duty cycle in real time.

The linkage works like this: temperature sensor feeds the junction temperature to the controller. The controller looks up the derating curve and calculates the maximum allowable current. That current limit is sent to the gate driver as a reference voltage or a digital command. The gate driver then adjusts the PWM duty cycle to stay within the limit. If the load demands more current than the derated limit allows, the controller either reduces the output or triggers a controlled shutdown with a cool-down timer.

Fault Sequencing: What Happens First Matters

When multiple faults occur simultaneously — overcurrent, overtemperature, desaturation, undervoltage — the protection system must decide what to do first. Getting this sequence wrong can make a bad situation worse.

Desaturation Beats Temperature Every Time

If a desaturation fault and an overtemperature fault happen at the same time, always respond to desaturation first. Desaturation means the module is already in a destructive state — the collector-emitter voltage is high while current is flowing, which means massive instantaneous power dissipation. Temperature is a lagging indicator. By the time temperature trips, the module may already be damaged. Kill the gate drive within 5 microseconds of desat detection, then handle the temperature fault as a secondary response.

Undervoltage Lockout Protects the Gate Itself

Undervoltage on the gate driver supply is a silent killer. If the gate voltage sags below the minimum required level, the module enters the linear region and dissipates enormous power. Most gate driver ICs include undervoltage lockout, but the threshold is often set too low. Raise the UVLO threshold to at least 10 volts for IGBT modules and 8 volts for MOSFET modules. This ensures the gate never receives a weak signal that could cause partial turn-on and overheating.

Link the UVLO fault to the main controller so that a restart sequence is required after the fault clears. Do not allow automatic restart — a momentary voltage dip should not bring the module back online at full power without a controlled ramp-up.

Real-World Testing of Linked Overheat Protection

Test at the Edges, Not the Middle

Most engineers test overheat protection at nominal conditions — rated current, room temperature, clean power supply. That tells you nothing about how the system behaves when it matters most. Test at maximum bus voltage, maximum ambient temperature, and maximum load current. Then test with degraded cooling — partial fan failure, reduced coolant flow, clogged heat sink fins.

The protection system must work under all of these conditions. If it only trips at 175 degrees when the cooling is perfect but fails to trip at 150 degrees when the fan is slow, you have a protection system that only works in the lab.

Measure Junction Temperature Directly When Possible

Case temperature sensors are convenient but inaccurate. If your application demands high reliability, use a temporary junction temperature measurement during development. The VCE method — measuring the collector-emitter voltage during a short current pulse — gives you a direct reading of junction temperature without an external sensor. Use this data to calibrate your case-to-junction thermal model so that your protection thresholds are based on real junction temperature, not an estimate.

The protection thresholds you set based on accurate junction temperature data will be far more reliable than thresholds pulled from a generic datasheet curve. Every module has slightly different thermal characteristics, and every heatsink has different thermal resistance. Generic numbers are a starting point, not a finish line.


footer Upper Image