Dossier (formerly DPACK-F) is designed to bring insight to unstructured data through rapid file characterization. Once the file characterization process has completed the data can be used to provide useful suggestions regarding capacity utilization, de-duplication, compression, archiving and a wide variety of other associated insight. Dossier creates a PPT report upon completion that will be available for download.
Dossier operates in three phases: discovery, collection, and reporting.
For the prototype, users manually browse and select root folders on the local host to analyze. The tool will optionally automatically populate some default folders. Default folders will include:
Once the paths to folders have been selected, Dossier will begin walking the file system trees for the target folders. This process will likely take up to several hours for large file systems.
The Dossier collector walks filesystems visible to the host server as configured by the user. Walking the filesystem has a similar impact to a malware/anti-virus scan in terms of the CPU, memory, and disk resources that will be utilized during the collection. A series of customizable options allow the user to deselect more invasive operations and govern the CPU and memory resources allocated to the Dossier collector.