In the digital age, the recovery of deleted data is a key challenge in digital forensics. With the constant increase in data volumes and storage methods, conventional methods are reaching their limits. This is where the Carve-DL research project comes in: an AI-based solution that can recover files that are difficult to reconstruct by the use of learning algorithms to sustainably improve the efficiency and accuracy of digital data reconstruction.
Find out more about the project on the official website: Carve-DL project page
Traditionally, forensic examiners use standardised, often manual processes to recover deleted data. While these methods rely on fixed file signatures or file system metadata, Carve-DL breaks new ground. Using advanced deep learning technologies, in particular Swin Transformer V2 and ResNet, the software can not only recover complete files but also reconstruct highly fragmented data. This enables precise recovery even in cases where traditional techniques prove to be insufficient.
Carve-DL is aimed at digital forensics specialists who need to reconstruct deleted or fragmented data. One example is the recovery of automatically deleted cache data from websites that is relevant to an investigation. Manipulated or deliberately destroyed digital evidence can also be reconstructed using AI.
Further background information on Carve-DL in criminal investigations can be found in the article: Carve-DL: Artificial intelligence in the fight against crime
The accompanying video uses a fictional crime story to show how Carve-DL can reconstruct deleted image data. In the fictional scenario, the Mona Lisa is stolen and all digital traces of the crime are deleted. The video illustrates how Carve-DL reconstructs the original record of the stolen painting from fragmented memory data of the thief, thus enabling forensic analysis.
This example is intended to illustrate the practical benefits of the developed AI methods: the system can identify, classify, group and correctly arrange deleted image fragments – a process that can also be crucial for real digital evidence.
Since the project kick-off in November 2022 significant progress has been made. The AI-Workflow has continousily been optimized to tackle the complex demands of digital forensics and data recontruction competently:
The use of Swin Transformer V2 and ResNet has significantly increased the efficiency of the system. In particular, Supportive Clustering with Contrastive Learning (SCCL) has increased clustering accuracy to around 85%.
One of the biggest challenges during the project was the indeterminate number and nature of the fragments to be reconstructed. Carve-DL solved this problem by processing this uncertainty early in the pipeline through iterative clustering.
Another problem was the scalable and efficient reordering of the fragments. To address these issues, a combination of digital signal processing and low-rank approximation (LoRA) was integrated in order to use computing resources more efficiently.
In addition to police investigations, Carve-DL shows promising potential for other fields of application:
With the Carve-DL project due to come to an end in October 2025, the research team draws a positive balance. The developed technologies show that AI-based data reconstruction can revolutionise digital forensics. Through innovative methods, it is possible to recover deleted or fragmented data with unprecedented precision.
Further information and current developments:www.carve-dl.de
Teamlead "GeT-AI" at the Department Smart Data & Knowledge Services, DFKI