Supported formats
Datagrok supports over 50 file formats, including domain-specific like SDF, FASTA, and others. It also understands how data is structured or represented within those files (e.g., assay plates within an XLSX, SDTM conventions within a CSV, and notations like SMILES and HELM).
Structured and analytical data
Tabular and semi-structured data
| Format | Description | Required plugins |
|---|
.csv | Comma-separated values | -- |
.tsv | Tab-separated values | -- |
.txt | Plain text | -- |
.xlsx | Microsoft Excel | -- |
.d42 | Datagrok [dashboadr] | -- |
.json | JavaScript Object Notation | -- |
.xml | Extensible Markup Language | -- |
.HTML | HyperText Markup Language | -- |
| Format | Description | Required plugins |
|---|
.rds, .rda | R serialized data | -- |
.mat | MATLAB array and matrix data | -- |
.h5 | Hierarchical data in HDF5 | -- |
.nc, .netcdf | NetCDF scientific data file | -- |
.sas7bdat | SAS database file | -- |
.kxl | KeyCreator engineering data format | -- |
.pzfx | Prism 5 | Curves |
.prism | Prism 10 | Prism |
| Format | Description | Required plugins |
|---|
.parquet | Columnar storage format | Arrow |
.feather | Arrow-based binary format | Arrow |
.sqlite | SQLite database | SQLite |
Archives and packaging
| Format | Description | Required plugins |
|---|
.zip | ZIP archive (auto-extracted for supported types) | -- |
.tar | Tape archive | -- |
.gz, .gzip | Gzip-compressed files | -- |
Notebooks and documents
Notebooks and computations
| Format | Description | Required plugins |
|---|
.ipynb | Jupyter Notebook | Notebooks |
.ivp | Interactive differential equations | Diff Studio |
Docs and reports
Domain-specific formats
Molecular structures
| Format | Description | Required plugins |
|---|
.sdf, .sd | Structure-data file | Chem |
.mol | MDL Molfile | Chem |
.mol2 | SYBYL molecule representation | Chem |
.smi | SMILES file format | Chem |
SMILES | Linear representation of molecular structure | Chem |
SMARTS | Substructure search patterns | Chem |
InChI, InChIKey | IUPAC identifiers | Chem |
SMIRKS | Reaction transformation notation | Chem |
Macromolecules and sequences
| Format | Description | Required plugins |
|---|
.fasta | FASTA format (fa, fna, ffn, faa, frn, fa, fst) | Bio |
HELM | Biopolymer notation for peptides and oligonucleotides | Bio |
BILN | Linear notation for complex biopolymers | Bio |
Phylogenetic trees
| Format | Description | Required plugins |
|---|
.nwk, .newick | Newick format for phylogenetic trees | PhyloTree Viewer |
3D structures and density maps
Electron density and volumetric data
Bioassays and experimental data
| Format | Description | Required plugins |
|---|
.xlsx | Assay plate data (auto-detected in file) | Curves |
Spectroscopy and NMR
| Format | Description | Required plugins |
|---|
.jdx | Spectroscopy data (JCAMP-DX) | Chem Spectra Viewer |
.dx | NMR spectroscopy data (JCAMP-DX variant) | NMRium |
.nmrium | NMR experiment data for NMRIum viewer | NMRium |
Biosignals
| Format | Description | Required plugins |
|---|
.edf | Multichannel biosignal recordings (e.g., EEG, EMG) | -- |
Clinical and regulatory data
| Format | Description | Required plugins |
|---|
.csv | SDTM-formatted datasets (auto-detected from folder structure) | Clinical Case |
| Format | Description | Required plugins |
|---|
.kml | Geospatial annotation file (Google Earth) | GIS |
.kmz | Compressed geospatial annotation file | GIS |
.geojson | Geospatial data encoded in JSON | GIS |
.topojson | Compressed topological geospatial data | GIS |
Images and diagrams
| Format | Description | Required plugins |
|---|
.jpg, .jpeg | Photo, scan | |
.png | Screenshot, icon, UI graphic | |
.svg | Logo, diagram, UI asset | |
.bmp | Raw pixel data | |
.excalidraw | JSON-based vector drawings | Excalidraw |