Skip to main content

FAQ

Quick reference

Drug design and discovery

Q: Can Datagrok handle multiple complex data types in one interface?

Datagrok allows access to many different and complex data types in a single user interface, supporting use cases like DMTA and other integrated chemical and assay data workflows. See Example: Complex data types in one user interface

Q: Can Datagrok support visualization and analysis for different biological and chemical modalities?

Datagrok supports multiple modalities, including:

Q: What advanced molecular analyses does Datagrok support?
Q: Is there a sample dashboard for chemists?
  • This dashboard has multiple visualizations, including chemical spreadsheet, multiple scatterplot views, R-groups, and molecule cards
  • This DMTA spreadsheet shows multiple modalities in one view
  • The Demo App showcases common analyses like MMP or chemical space, and how they look in Datagrok
Q: Is there a sample dashboard for biologists?

The Demo App showcases common analyses like sequence space or Peptide SAR, and how they look in Datagrok.

Assays, plates, curves

Q: Does Datagrok support compound-centric and assay-centric data views?

Yes. Datagrok supports both:

Users can configure coloring, highlighting, and switch between compounds as rows or columns, and filter, search, and sort the data.
Common actions | Substructure search

Q: How can I ingest raw assay data into Datagrok?

Raw data can be ingested through multiple channels, including drag-and-drop, file shares, database connections, OpenAPI, and integrations with services like Benchling or Revvity Signals. Datagrok is developing a Plates application to support predefined plate templates, batch ingestion, and integrated analysis in one place.

Q: How can I perform curve fitting, normalization, QC calculations from raw assay data in Datagrok?

The Curves plugin provides a complete workflow for converting raw assay data into fitted curves with QC calculations. It transforms well-level assay data into fitted dose–response curves, supporting functions such as 4PL and sigmoid, and can be easily extended to additional analyses, including Km/Vmax, dose ratio, melt curves (DSF Tm), and qPCR, by defining the corresponding functions and parameters.

For modeling dynamic systems, see the Diff-Studio plugin that allows users to solve sets of differential equations interactively through an intuitive UI and a declarative approach.

Q: Can Datagrok visualize dose-response curves for multiple compounds?

Datagrok supports visualizing DR curves for multiple compounds simultaneously and allows plotting curves using custom equations and parameters on the fly (e.g., 4-parameter curve fit over a 10-point dose-response).
Multicurve viewer | Use case for assay plates and DRC

Deploy

Q: How is Datagrok deployed?

Datagrok supports Docker, Kubernetes, on-premises, and cloud deployments (AWS, GCP, Azure).
Learn more about Deployment

Q: What are the end-user computer and browser requirements?

Datagrok runs completely in the browser. There is no need to install any additional software. Datagrok is compatible with modern browsers, including Chrome, Edge, and Safari.

Q: How is logging and monitoring handled?

Datagrok uses AWS CloudWatch to collect and analyze logs and metrics.

Q: How are backups and restores managed?

AWS provides automated RDS and S3 backups; RDS can also be restored as a standard PostgreSQL database.

Q: What is the disaster recovery strategy?

High availability is supported via Docker and AWS clusters that automatically restart failed instances.
See Disaster recovery demo for more information.

Access

Q: What data sources can Datagrok connect to?

Datagrok connects to any data source, including databases, file storage systems, web services and APIs. The platform supports 50+ file formats, including domain-specific like SDF, FASTA, and others.

Q: Can I connect Datagrok to BigQuery or other data stores?

Yes. Datagrok connects to BigQuery and most other popular databases out of the box. In addition, any machine-readable data source can be easily integrated.

Q: Can Datagrok accommodate users with different levels of experience for query creation and usage?

Yes. Power users (e.g., modelers) can create or edit queries and share them with their teams, while light users (e.g., experimentalists) can use these queries directly without creating or maintaining them. See Working with queries and Scripting.

Q: Can users create data queries in Datagrok without prior knowledge of data sources, structures, or SQL?

Yes. Users can build queries using the Visual Query Editor or the AI Query Builder.

Q: Can data queries in Datagrok be exported for storage, sharing, or reuse?

Yes, see Sharing query results.

Q: Can users access data in real-time with responsive performance in Datagrok?

Yes. Users can access data in real-time, and the UI loads data quickly. Dynamic dashboards enable interactive exploration.

Govern

Q: Do you follow secure development standards and industry best practices?

Yes, we adhere to secure development and industry best practices across infrastructure, development, and enterprise security:

  • Security-first infrastructure – designed with security from the ground up. Features include secure credentials management, flexible authentication (OAuth, SSO, Active Directory), and role-based access control.
    Learn more | Authentication & authorization | Role-based access

  • Quality assurance – automated check for vulnerabilities (Snyk, Grype) and multi-layered testing including unit, integration, UI, and performance tests.
    Learn more | Testing

  • Secure development lifecycle – CI/CD pipelines enforce automated build, test, and security checks; semantic versioning; and secure credential management.
    Learn more | Version control | Credential management

Governance & access

Authentication & authorization

Q: How does Datagrok handle authentication and authorization?

Datagrok uses role-based access control and integrates with enterprise identity providers such as LDAP, SSO, and OAuth. Learn more about access control

Q: Is data in Datagrok restricted to users with proper permissions?

Yes. Data is accessible only to users with appropriate permissions, supporting cross-project access and differentiation between internal and external (CRO) users. See User permissions and Authorization.

Q: Who has access to Bitbucket, and what are the authentication requirements?

Only datagrok core developers have access. MFA is being enabled to strengthen authentication. Bitbucket also provides built-in security scanning features.

Q: How is MFA enforced for GitHub accounts?

GitHub requires all code contributors to enable two-factor authentication (2FA) as of March 2023. Developers comply by using device-tied passcodes.

Data privacy & usage

Q: What happens to my data when I open a local file in Datagrok?

When you open a local file in Datagrok (like dragging and dropping a file to your browser), you can analyze it without saving. This data stays in your browser's memory and isn't sent to the server unless you run resource-intensive server-side computations. Your data is gone when you close the browser tab. To save your work, you need to upload it to the server. Note that uploading data does not make it accessible to others. Your data stays private and visible to you only until you explicitly share it. Learn how to share and share data.

Q: What data or telemetry is sent back to Datagrok?
  • No automatic telemetry: Datagrok does not send any data to Datagrok servers by default.
  • Optional error report/feedback: Users can optionally send feedback or error reports to Datagrok by selecting the "Email" checkbox in the corresponding dialog. This feature can be disabled by the administrator. For details, see Feedback
  • Package/image pulls: Datagrok can download images from Docker Hub or packages from NPM.
Q: When deployed on-prem, is my data ever gets sent anywhere?

Your data is safe and secure with Datagrok, and nothing gets sent outside your security perimeter by default. All computations are done either in-the-browser, or in docker containers deployed on your virtual cloud. However, some of the optional plugins are explicitly designed for connectivity with external systems. good examples are CDD Vault Link, Benchling Link or Chemspace.

Data security

Encryption

Q: Is data encrypted at rest?

Yes, Datagrok relies on Amazon's built-in encryption for RDS and S3 buckets.

Q: Is data encrypted in transit?

Yes, all client-server communications use HTTPS, which means it is secure and encrypted.

Vulnerability & patch management

Q: How do you manage vulnerabilities in the application and cloud infrastructure?

We monitor CISA alerts and run daily Snyk scans on container builds. Vulnerabilities are triaged, remediated via infrastructure-as-code pipelines, and verified through CI/CD testing. Customers are notified and supported in version upgrades as needed. Learn more about Infrastructure

Q: How are OS and server patching handled?

Jenkins and development servers are rebuilt quarterly with a fresh OS. Documentation for builds is maintained in internal repositories. Learn more about Deployment

Endpoint & device security

Device controls

Q: What controls mitigate risks on BYOD devices?

We have a policy that all devices must have active malware protection, updated signatures, and drive encryption enabled.

Q: How is data exfiltration via removable media prevented?

We have a policy that sensitive data must not be stored on removable media. Passwords or credentials must not be transmitted unencrypted.

EDR

Q: What is Datagrok’s current EDR approach?

No dedicated EDR is deployed on associates’ devices. However, all devices must have up-to-date and active malware protection. Associates are required to report any suspicious activity via the alert channel. Incident response team triages and executes the incident management process.

Q: How are security logs collected and monitored?

AWS resources use centralized logging. Currently, there is no proactive log review, but reporting of failed logins with alert thresholds is being implemented.

Transform

Q: Can I add a calculated column from a user interface?

Yes. The Add new column feature is very powerful. You can:

  • write your own expressions manually
  • use 500+ available functions, or
  • add your own functions using custom packages and scripts.

For example, from a SMILES column you can generate molecular properties (e.g., MW, cLogP) or ADMET predictions.

Explore

Q: Can Datagrok handle complex scientific data types in a single interface?

Datagrok supports a wide range of capabilities, including:

Multiple complex data types in one interface | Explore data

Visualize

Q: What are the maximum dataset sizes?

Datagrok handles millions of data points interactively for visualization and exploration.

The fundamental limitation is the amount of RAM allowed to use by the browser tab, which is different between browsers (currently no limit for Firefox, and 4GB for Chrome). With Datagrok's efficient in-memory data engine, even the 4GB lets you work with the following datasets:

  • 10 numerical columns: 100,000,000 rows
  • 100,000 numerical columns: 1,000 rows
  • Small molecules as SMILES: 10,000,000 rows

And if the dataset won't fit in the memory, Datagrok provides powerful ways to work with the database.

Q: What visualization options does Datagrok provide for data analysis and chemical structures?

Datagrok supports rich visualization for both chemical and general data, including:

See also: Viewer gallery, Table view, Grid.

Q: Can I create custom visualizations?

In addition to the built-in viewers, which are highly customizable and extendable, you can create fully custom visualizations using various approaches:

developers

You can visualize tabular data (viewers, cell renderers), individual objects (e.g. molecules), file and folder contents. See Can developers create custom visualizations? for more information.

Q: How can I visualize old and new data side by side?

Multiple viewers and grids can be combined in a single dashboard to display different datasets side by side. Each viewer (scatterplots, charts, etc.) has configurable settings, allowing plots to source data from different tables. There are also multiple ways to link tables, join, pivot, and apply other transformations to data within the intuitive UI.

Q: Can Datagrok tables be exported for presentations or analysis?

Tables can be exported in multiple formats, including images, CSV/TSV, and SDF.
Learn more about Downloading

Q: Can I merge multiple columns into a single "form cell" that displays label-value pairs (e.g., cLogP: 2.09) for these columns as a formatted list within that cell?

Yes. Use smart forms.

See visual

smart forms

Q: Can I pivot a table so each object (e.g., molecule) becomes a column and values (e.g., molecular properties) appear as rows?

Yes. Use Forms viewer.

See visual

Forms viewer

Q: How does color coding work in Datagrok?

Datagrok supports several color coding modes applied at the column level, with colors automatically synchronized across viewers:

  • Linear: Maps numeric values to a color gradient. You can configure the color palette, min/max values, below/above min colors, and invert the scheme. Absolute Linear mode allows assigning exact colors to specific numeric values.
  • Conditional: Applies colors based on value ranges or rules. You can configure the rules dictionary (ranges or comparison operators) and choose whether to apply colors to text or background.
  • Categorical: Assigns colors to categorical values (strings, booleans). You can configure the category-to-color map, fallback color, and select the color scheme via UI or API.

Domain-specific color coding: For chemical data analysis, the Scaffold tree assigns colors to scaffold structures that can be used as the color source in scatterplot. Scaffold colors follow inheritance rules where child nodes inherit parent colors unless explicitly overridden.

Color coding remains consistent across the grid and all viewers that reference the same column. Color coding persists in saved projects and layouts.

See visual

Forms viewer

Q: How can I search and filter data quickly and intuitively?

Datagrok provides advanced and intuitive search and filtering capabilities:

Collaborate

Q: Can users save and share custom dashboards, analyses, and visualizations in Datagrok?

Users can create projects that include data, analyses, and dashboards, then share them with others. Dashboards are dynamic and customizable.
Creating and managing projects | Creating dynamic dashboards

The Sticky meta feature allows tagging compounds, sequences, or other entities with meta-information, such as calculated properties, analysis results, or comments. Tagged information can be recalled anywhere in Datagrok when the same entity appears, supporting collaborative workflows and effective communication between team members.

Q: Can users subscribe to notifications about new or updated data in Datagrok?

Datagrok keeps you informed about relevant activity. See Collaborate, Notifications, and Discussions.

Develop

Architecture

Q: How is Datagrok architected?

Datagrok is built as a modular, service-oriented platform with client-server separation.
Learn more about Datagrok's architecture

Interoperability

Q: Can Datagrok call external web services?

Yes, both client and server can call authenticated web services and integrate responses with the Datagrok API. See Developer guide for details

Q: How do I interact with Datagrok from the server?

Use Datagrok’s REST server API with authentication tokens.
See Proof of concept video for more information

Q: How do I interact with Datagrok from the client?

Use the JavaScript API for custom applications and extensions.
Learn more about JavaScript API

Developer experience

Q: Does Datagrok support programmatic integration through customized scripts and APIs?

Datagrok provides robust capabilities for programmatic integration:

  • Scripting: users can automate tasks and extend functionality using JavaScript or Python scripts. See Scripting
  • APIs: Datagrok provides two primary types of APIs: JavaScript and Python. See API
  • Develop: the platform supports the development of custom plugins and modules to enhance its capabilities. See Develop
Q: How do I develop and debug Datagrok customizations?

Follow the step-by-step guide for creating your own package. If your package is based on a non-standard template, you may need to configure it differently for debugging. See also a debug demo for details.

Q: How does Datagrok handle package development and deployment?

The process includes package deployment, dependency management, and versioning.
See Developer guide for details

Q: Can multiple developers work concurrently?

Yes, with versioning, branching, and package management.
Learn more about Concurrent development

Q: Are Datagrok’s functions and APIs well documented?

Datagrok provides detailed documentation for all functions and APIs to help developers integrate, automate, and extend the platform. See APIs.

Q: Does Datagrok provide support and training for users?

Datagrok offers extensive help, videos, community forum, self-guided learning, tutorials, and on-platform demos to help scientists gain familiarity and mastery of the platform.

Scalability and performance

Q: How stable is the platform under heavy load?

Datagrok is designed for horizontal scaling, ensuring stability with concurrent users.
See Scaling and stability for more information

Extensibility

Q: Can developers create custom visualizations?

Datagrok offers multiple options for creating and embedding custom visualizations:

Q: Can I build server-side components?

Yes, custom back-end logic can be added.
See Admetica example

Q: Can I add scripts and reuse them in components?

Yes, Datagrok supports multiple scripting languages.
Learn more about Scripting

Q: Can I reskin Datagrok?

Yes, you can build tailored UI applications.
See Example app

Q: Can I build full custom applications?

Yes, including workflows, data models, state management, and persistence.
See Custom application packages

Frontend

Q: What frontend capabilities does Datagrok provide for scientific data analysis?

Datagrok delivers a high-performance, interactive frontend for handling, visualizing, and analyzing large-scale scientific datasets. Key capabilities include:

  • Data capacity and client-side handling – load and explore millions of rows directly in the browser using Datagrok’s in-memory data engine. Supports high-throughput and virtual screening scenarios with GPU-accelerated visualizations.
    See also: Example: 2.7M ChEMBL molecules | WebGPU acceleration
  • Cross-connecting multiple large data tables – perform sophisticated data manipulation including joins, filters, and aggregation. Linked tables enable interactive cross-referencing across datasets.
    Learn more about Transformations
  • Visualization capabilities – Over 50 interactive viewers for synchronized dashboards, including chemically-aware viewers
  • 2D/3D structure rendering and interaction.
    See Cheminformatics for details
  • Interactivity features – live data masking, filter by selection, synchronized updates across viewers, multiple input methods including 2D sketching.
    See Common actions, Structure search, and Sketching and editing for details
  • API integration and extensibility. See JavaScript API for details
  • Frontend scripting scalability. See Compute for details