Semantic File Inspector

Hello!

I have made a tool that can describe any file or piece of data, including
its formats and contents, in RDF: the Semantic File Inspector, available at
https://sfi.is4.site/ (requires WebAssembly)! It currently supports over 30
different formats including common media formats, archives, executables and
documents, and is able to collect rich metadata including common file
properties and format-specific properties such as image dimensions and
others, it can also compute hashes using various algorithms to describe and
identify the data, and it encodes all of this in RDF using common
vocabularies found on the semantic web, with the possibility to save the
result in one of the many RDF serialization formats, or use SPARQL to
extract information or data.

Additional cool abilities:
- It can represent and describe files at different levels of abstraction,
for example: the file node itself, its binary/text content, the XML
document it encodes, and the object it represents.
- It can derive multiple formats from a single file, for example both ISO
and UDF from images.
- It looks recursively into archives or other resources storing resources.
- It can emulate MS-DOS executables and store their output as
dcterms:description.
- All components can be configured or disabled, if needed. Plugins may be
developed for additional functionality.
- No data is sent anywhere; all runs in the browser.

I would be happy for any feedback. Code will be available soon.

Enjoy!

Received on Thursday, 30 March 2023 13:55:44 UTC