Triumfant’s Automated Remediation – Not Voodoo, Sensible Can-Do
June 14, 2010 Leave a Comment
It has come to my attention that many have a hard time getting their heads around our automated remediation capabilities. The concepts around the way Triumfant performs automated remediation are really quite simple, so allow me to explain:
We know what changed. We continuously scan the machine for changes and if we see an indication that the machine is under attack we perform an accelerated full scan to kick off the analysis process. So when Triumfant’s patented analytics perform the analysis of a malicious incident, each and every change to the machine is available for consideration. Triumfant not only sees what has changed, but we are uniquely able to group changes to identify what changes are part of each specific incident. The analytics leverage over 25 different correlation algorithms to determine all of the primary and secondary artifacts from any given attack. We identify the attack and all of the changes associated withe the attack such as configuration changes and opened ports. The changes break down into three basic change types: unexpectedly present means that something new has been added, unexpectedly absent means that something that was there is no longer there, and unexpectedly modified means that the value has been changed.
We know what the attribute or file looked like before it changed. The first step performed by the Triumfant agent is to take a snapshot of the over 200K attributes we monitor. This includes an MD5 hash of every file on the machine. A copy of this snapshot is continuously maintained on the endpoint and on the Triumfant server. Therefore, Triumfant has a very logical and unique set of data that serves as the ingredients to write the remediation: we know what has changed, we know the current (changed) value, and we know the value prior to the change. Brutally simple in concept, but elegantly and efficiently executed.
We therefore can build a script to modify the things that changed back to what they used to be before they were changed. Once you know what attribute or file has changed and know what the attribute of file looked like before it was changed, it is not hard to construct a script to change things back. Actually, there are some challenges, but luckily our engineers have made it look simple. For example, it is easy to delete things that are not supposed to be on the machine, and it is easy to restore modified or deleted attribute values. It is not that simple to restore missing or corrupted files. That is why Triumfant’s donor technology (patent pending) is so remarkable. Triumfant uses our knowledge base (automatically generated) to find a donor machine that has the same missing or corrupted file (version, OS, validated by the MD5 hash) and uses that donor machine to provide a copy to move to the affected machine. I will explore the donor technology and the context that powers it in a future post, suffice to say the capability is completely unique to Triumfant and is an elegant solution to a very difficult problem when considering automated remediation.
Makes sense when you lay it out this way, doesn’t it? Triumfant uses this very simple logic flow to build a custom remediation script for each and every incident that is contextual, situational, and surgical. The script is constructed without the need for human intervention at the server and sent to the agent for execution after confirmation by an administrator. The remediation only affects those attributes and files that were part of the attack and does not affect any of the changes done to the machine outside of the incident. None of the user’s work or any of the benign changes to the machine are lost. And you should not have to re-image the machine out of fear that there may be artifacts of the attack still lurking on the machine.
This is not a rollback to an image, there is no interaction required by the end user, including the requirement (accept in the most extreme cases) to reboot. We are not pulling from a library of pre-written remediations that can’t possibly know enough to address all of the primary and secondary artifacts of an attack.
This is not VooDoo, but sound, sensible science. It takes the concepts of change detection and extrapolates it to the logical end – not only can Triumfant see the attacks that evade other defenses, it can build a remediation that stops the attack and removes all of the collateral damage of the attack. We are not a shield, but we go from infection (not detection, which for many tools takes days, weeks, even months) to remediation in less than five minutes. So given that the shields miss so much, the fact that malware exists on the machine for five minutes is a more than equitable trade-off for those organizations dealing with the advanced persistent threat, zero day attacks, and rootkits.
Finally, I know the term “automated” gives everyone heartburn. Everyone likes the concept, but is skittish on actually implementing. Not to worry. We build the remediation automatically, but by default it does not run automatically. The administrator will get an alert that malware has been detected, and the administrator can then evaluate Triumfant’s findings and validate the remediation before it is executed. And every remediation is completely reversible. We provide all of the analysis and write the remediation script, you actually put it into motion.
