FAQ

What steps are involved in switching production workloads to Firebolt once migration validation is complete?

Switching production workloads to Firebolt typically involves updating configuration to point to Firebolt endpoints. If all validation is complete and data is already present, this process is straightforward.

Deployment & Architecture

COPY LINK TO ANSWER

what-steps-are-involved-in-switching-production-workloads-to-firebolt-once-migration-validation-is-complete

https://firebolt.io/faqs-v2-knowledge-center/what-steps-are-involved-in-switching-production-workloads-to-firebolt-once-migration-validation-is-complete

What are Firebolt’s best practices for handling complex dashboard queries with varying granularity (e.g., daily, weekly, monthly)?

Firebolt recommends using aggregating indexes where possible for regularly queried granularities (e.g., daily or weekly), and employing pre-joined or pre-aggregated tables to simplify and speed up dashboard queries. Ensure indexes align closely with filter criteria to optimize query performance across various granularities.

SQL

COPY LINK TO ANSWER

what-are-firebolts-best-practices-for-handling-complex-dashboard-queries-with-varying-granularity-e-g-daily-weekly-monthly

https://firebolt.io/faqs-v2-knowledge-center/what-are-firebolts-best-practices-for-handling-complex-dashboard-queries-with-varying-granularity-e-g-daily-weekly-monthly

How can performance risks be mitigated when dealing with tenants of significantly different data sizes?

When a tenant comprises a large percentage of data (e.g., 20-25% of all data), avoid subqueries or joins that initially select large volumes of data and subsequently discard most rows. Instead, optimize queries and table structures to filter data as early and narrowly as possible, potentially using aggregated or pre-joined tables.

SQL

COPY LINK TO ANSWER

how-can-performance-risks-be-mitigated-when-dealing-with-tenants-of-significantly-different-data-sizes

https://firebolt.io/faqs-v2-knowledge-center/how-can-performance-risks-be-mitigated-when-dealing-with-tenants-of-significantly-different-data-sizes

What are Firebolt’s best practices regarding the use of views versus pre-joined tables for aggregations?

Firebolt supports both using views and pre-joined tables. However, if most of the query execution time is spent on joins rather than aggregations, pre-joining tables (i.e., creating wider, denormalized tables during data ingestion) is often more performant. Views are effective for reusable SQL but may become slower with complex joins at scale. Aggregating indexes, which can pre-materialize aggregation results for fast query responses, work best on single tables without cross-table joins.

SQL

COPY LINK TO ANSWER

what-are-firebolts-best-practices-regarding-the-use-of-views-versus-pre-joined-tables-for-aggregations

https://firebolt.io/faqs-v2-knowledge-center/what-are-firebolts-best-practices-regarding-the-use-of-views-versus-pre-joined-tables-for-aggregations

Are primary indexes critical to Firebolt's query performance, and how should they be managed during migration?

Yes, primary indexes significantly impact query performance in Firebolt. Ensuring correct and optimized indexes is crucial, especially during migration. Indexes should be carefully reviewed and implemented based on query patterns and use cases.

SQL

COPY LINK TO ANSWER

are-primary-indexes-critical-to-firebolts-query-performance-and-how-should-they-be-managed-during-migration

https://firebolt.io/faqs-v2-knowledge-center/are-primary-indexes-critical-to-firebolts-query-performance-and-how-should-they-be-managed-during-migration

How do I invite more users to join my account?

You can add more users to your Firebolt account by either adding them through the web application under or with SQL commands. First create a login, using the email address of your invitee as the login_id. Next, associate the login to a user and assign them the appropriate permissions. Your invitee wiill automatically receive an email invitation to join your account. For more information visit our documentation.

Miscellaneous

COPY LINK TO ANSWER

how-do-i-invite-more-users-to-join-my-account

https://firebolt.io/faqs-v2-knowledge-center/how-do-i-invite-more-users-to-join-my-account

What are the best practices for setting up Superset to connect to Firebolt for dashboarding?

Setting up Apache Superset with Firebolt involves: - Installing Superset locally or on a server. - Configuring the Firebolt connector with appropriate credentials and connection parameters. - Testing queries in Superset to ensure Firebolt’s indexing structure is leveraged efficiently. - Optimizing queries for dashboard performance by using Firebolt’s indexing features to minimize latency. In this case, there were some challenges with reinstalling Superset, but Firebolt’s team is available to assist with setup and troubleshooting.

Integrations

COPY LINK TO ANSWER

what-are-the-best-practices-for-setting-up-superset-to-connect-to-firebolt-for-dashboarding

https://firebolt.io/faqs-v2-knowledge-center/what-are-the-best-practices-for-setting-up-superset-to-connect-to-firebolt-for-dashboarding

How should primary indexes be selected in Firebolt to optimize query performance?

Primary indexes should include the most frequently used filters, such as tenant_id and date/time columns if queries consistently filter data by tenant and date ranges. A well-chosen primary index ensures queries access only relevant data partitions, maintaining fast performance even as data volumes scale significantly.

SQL

COPY LINK TO ANSWER

how-should-primary-indexes-be-selected-in-firebolt-to-optimize-query-performance

https://firebolt.io/faqs-v2-knowledge-center/how-should-primary-indexes-be-selected-in-firebolt-to-optimize-query-performance

What factors significantly impact query performance when joining high-cardinality tables in Firebolt?

Query performance in high-cardinality joins is significantly impacted by data cardinality, joins resulting in large intermediate row outputs, and data shuffles across nodes. Firebolt users should leverage the EXPLAIN ANALYZE functionality to identify expensive operations such as table scans, joins, and shuffles. Reducing data volume before joins through effective indexing, semi-joins, or aggregation indexes can mitigate these impacts.

SQL

COPY LINK TO ANSWER

what-factors-significantly-impact-query-performance-when-joining-high-cardinality-tables-in-firebolt

https://firebolt.io/faqs-v2-knowledge-center/what-factors-significantly-impact-query-performance-when-joining-high-cardinality-tables-in-firebolt

Are semi-joins (WHERE IN clauses) generally more performant in Firebolt than explicit joins for filtering datasets?

Yes, semi-joins (implemented via WHERE IN clauses) can be more performant than explicit joins, as Firebolt has built-in optimizations that leverage semi-joins for better data pruning. Using semi-joins helps reduce intermediate row counts earlier in query execution, especially beneficial for high-cardinality datasets.

SQL

COPY LINK TO ANSWER

are-semi-joins-where-in-clauses-generally-more-performant-in-firebolt-than-explicit-joins-for-filtering-datasets

https://firebolt.io/faqs-v2-knowledge-center/are-semi-joins-where-in-clauses-generally-more-performant-in-firebolt-than-explicit-joins-for-filtering-datasets

How do I query my S3 buckets using IAM roles?

First, on your S3 account, confirgure the permission policy found in the help center article, https://docs.firebolt.io/Guides/loading-data/configuring-aws-role-to-access-amazon-s3.html#use-aws-iam-roles-to-access-amazon-s3. While still in your AWS Identity and Access Management (IAM) Console, start the process to upload data through the plus sign icon in the develop space. After selecting an ingestion engine, you can select 'IAM Role' as your authetnication method and you can create an IAM role in the application. Copy the trust policy here and follow the rest of the instructions in the article to apply to your AWS account. Note that you don't actually have to upload anything to create the IAM role.

SQL

COPY LINK TO ANSWER

how-do-i-query-my-s3-buckets-using-iam-roles

https://firebolt.io/faqs-v2-knowledge-center/how-do-i-query-my-s3-buckets-using-iam-roles

What is the difference between CPU time and thread time in Firebolt's query profile analysis, and why is it important?

In Firebolt's query profiling, CPU time refers to the actual processing time on CPU cores, while thread time represents the total wall-clock time across all threads and nodes. When thread time is significantly higher than CPU time, it typically indicates waits due to data loading from storage (like S3) or node concurrency constraints. This distinction helps diagnose bottlenecks related to IO-bound or compute-bound workloads.

Low Latency

COPY LINK TO ANSWER

what-is-the-difference-between-cpu-time-and-thread-time-in-firebolts-query-profile-analysis-and-why-is-it-important

https://firebolt.io/faqs-v2-knowledge-center/what-is-the-difference-between-cpu-time-and-thread-time-in-firebolts-query-profile-analysis-and-why-is-it-important

What's the best practice for setting up a Firebolt engine to support high concurrency?

For high concurrency, use multiple clusters within your engine. Clusters help handle more simultaneous queries by distributing the load. Keep in mind that cache is shared across nodes in a cluster, but not between clusters, so the right balance depends on your workload. You can also consider using auto-scaling to dynamically adjust resources based on demand.

SQL

COPY LINK TO ANSWER

whats-the-best-practice-for-setting-up-a-firebolt-engine-to-support-high-concurrency

https://firebolt.io/faqs-v2-knowledge-center/whats-the-best-practice-for-setting-up-a-firebolt-engine-to-support-high-concurrency

How can I learn if there is an issue causing interruption to Firebolt services or applications?

Firebolt proatively maintains a status page at https://firebolt.statuspage.io/ where we keep you notified about any active incidents that may cause interruption to your access or services. From this page, you can also hit the 'subscribe' button to stay informed by phone, RSS, email, or Slack.

Miscellaneous

COPY LINK TO ANSWER

how-can-i-learn-if-there-is-an-issue-causing-interruption-to-firebolt-services-or-applications

https://firebolt.io/faqs-v2-knowledge-center/how-can-i-learn-if-there-is-an-issue-causing-interruption-to-firebolt-services-or-applications

What's the easiest way to label a query when running it from the Python SDK?

You can label a query by setting the query_label system setting before running it:

cursor.execute("set query_label = '<label>';")
cursor.execute("your_query_here")

‍

Here’s a full example using the Firebolt Python SDK:

id = '****'
secret = '****'

connection = connect(
    database="<db_name>",
    account_name="<account_name>",
    auth=ClientCredentials(id, secret)
)

cursor = connection.cursor()
cursor.execute("start engine <engine_name>")
cursor.execute("use engine <engine_name>")
cursor.execute("use database <database_name>")
cursor.execute("set query_label = '123';")
cursor.execute("select 1;")

print(cursor.fetchone())
connection.close()

SQL

COPY LINK TO ANSWER

whats-the-easiest-way-to-label-a-query-when-running-it-from-the-python-sdk

https://firebolt.io/faqs-v2-knowledge-center/whats-the-easiest-way-to-label-a-query-when-running-it-from-the-python-sdk

How can I make sure that my engines are not sitting idle and incurring infrastructure costs?

You can use the AUTO_STOP feature available in Firebolt engines to make sure that your engines are automatically stopped after a certain amount of idle time. Engines in stopped state will not be charged, hence do not incur any costs. As with other engine operations, this can be done via SQL or the UI. For example, while creating an engine, you can specify the idle time, using AUTO_STOP, as below:

CREATE ENGINE IF NOT EXISTS MyEngine WITH
TYPE = “S” NODES = 2 CLUSTERS =1 AUTO_STOP = 15;

The above command will ensure that MyEngine will be automatically stopped if it has been idle for 15 minutes continuously. Alternatively, you can achieve the same after an engine has been created.

ALTER ENGINE MyEngine SET AUTO_STOP = 15;

For mor information, please see the Engine Consumption Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/how-can-i-make-sure-that-my-engines-are-not-sitting-idle-and-incurring-infrastructure-costs

Engines

COPY LINK TO ANSWER

Button Text

How do I start and stop engines in Firebolt?

To start an engine:

sql START ENGINE MyEngine;

To stop an engine:

vbnet STOP ENGINE MyEngine;

For more information, please refer to the Work with Engines Using DDL article in the Firebolt Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/how-do-i-start-and-stop-engines-in-firebolt

Engines

COPY LINK TO ANSWER

Button Text

Will the creation of an engine automatically result in the creation of the underlying cluster(s)?

Yes. By default, creating an engine would result in the creation of the underlying engine clusters and start the engine. This would enable the engine to be in a running state where it is ready to start serving the queries. However, you have the option to defer the creation of the underlying clusters for an engine by setting the property “INITIALLY STOPPED” to True while calling CREATE ENGINE. You can start the engine at a later point, when you are ready to start running queries on the engine. Note that you cannot modify this property after an engine has been created.

CREATE ENGINE IF NOT EXISTS MyEngine WITH

TYPE = “S” NODES = 2 CLUSTERS =1 START_IMMEDIATELY = FALSE;

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/will-the-creation-of-an-engine-automatically-result-in-the-creation-of-the-underlying-cluster-s

Engines

COPY LINK TO ANSWER

Button Text

What happens to my currently running queries when I perform a scaling operation?

Your queries will continue to run uninterrupted during a scaling operation. When you perform horizontal or vertical scaling operations on your engine, Firebolt adds additional compute resources per your new configuration. While new queries will be directed to the new resources, the old compute resources will finish executing any queries currently running, after which they will be removed from the engine.

For more information, check out our Engine Consumption Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/what-happens-to-my-currently-running-queries-when-i-perform-a-scaling-operation

Engines

COPY LINK TO ANSWER

Button Text

Do scaling operations result in any downtime for my applications?

No. Scaling operations in Firebolt are dynamic and do not require stopping the engine, so your applications will not experience downtime.

For more information, check out our Engine Fundamentals Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/do-scaling-operations-result-in-any-downtime-for-my-applications

Engines

COPY LINK TO ANSWER

Button Text

How does scaling work with Firebolt engines?

In Firebolt, you can scale an engine across multiple dimensions. All scaling operations in Firebolt are dynamic, meaning you do not need to stop your engines to scale them.

Scale Up/Down You can vertically scale an engine by using a different node type that best fits the needs of your workload.

Scaling Out/In You can horizontally scale an engine by modifying the number of nodes per cluster in the engine. Horizontal scaling can be used when your workload can benefit by distributing your queries across multiple nodes.

Concurrency Scaling Firebolt allows you to add or remove clusters in an engine. You can use concurrency scaling when your workload has to deal with a sudden spike in the number of users or number of queries. Note that you can scale along more than one dimension simultaneously. For example, the command below changes both the node type to “L” and the number of clusters to two.

ALTER ENGINE MyEngine SET TYPE = “L” CLUSTERS = 2;

All Scaling operations can be performed via SQL using the ALTER ENGINE statement or via UI. For more information on how to perform scaling operations in Firebolt, see the Guides section in documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/how-does-scaling-work-with-firebolt-engines

Engines

COPY LINK TO ANSWER

Button Text

How do I create an engine in Firebolt?

All operations in Firebolt can be performed via SQL or UI. To create an engine, you can use the “CREATE ENGINE” command (shown above), specifying a name for the engine, number of clusters the engine will use, number of nodes in each cluster and the type of the nodes used in the engine. After the engine is successfully created, users will get an endpoint that they can use to submit their queries. For example, you can create an engine named MyEngine with two clusters, each with two nodes of type “M” as below:

CREATE ENGINE IF NOT EXISTS MyEngine WITH TYPE = “M” NODES = 2 CLUSTERS = 2;

This creates an engine named "MyEngine" with two clusters, each containing two nodes of type "M". For more details, see the documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/how-do-i-create-an-engine-in-firebolt

Engines

COPY LINK TO ANSWER

Button Text

What is the typical start-up time for the Firebolt engine? Is it Guaranteed?

The typical start-up time for a Firebolt engine is 10-15 seconds, but this is not guaranteed due to potential resource constraints on AWS.

For more information, check out our Sizing Engines Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/what-is-the-typical-start-up-time-for-the-firebolt-engine-is-it-guaranteed

Engines

COPY LINK TO ANSWER

Button Text

Is there a limit on the number of databases a given engine can support?

No. While there is no theoretical limit on the number of databases you can use with a given engine, note that the configuration of your engine will determine the performance of your applications. Based on the performance demands of your applications and the needs of your business, you may want to create the appropriate number of engines.

For more information, check out our Engine Permissions Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/is-there-a-limit-on-the-number-of-databases-a-given-engine-can-support

Engines

COPY LINK TO ANSWER

Button Text

Do engines and databases have a one-to-one relationship?

No. Engines and databases are fully decoupled in Firebolt. A given engine can be used with multiple databases, and conversely, multiple engines can be used with a given database. On Firebolt, all engines can write to the same database. No need to segregate engines as read-write and read-only.

For more information, check out our Engine Permissions Documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/do-engines-and-databases-have-a-one-to-one-relationship

Engines

COPY LINK TO ANSWER

Button Text

How many clusters can I use per engine?

You can use up to 10 clusters per engine.

For more information, check out our documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/how-many-clusters-can-i-use-per-engine

Engines

COPY LINK TO ANSWER

Button Text

How many nodes can I use for each cluster in a given engine?

You can use anywhere from 1-128 nodes per cluster in a given engine.

For more information, check out our documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/how-many-nodes-can-i-use-for-each-cluster-in-a-given-engine

Engines

COPY LINK TO ANSWER

Button Text

What are the different types of nodes available in Firebolt?

There are four node types available in Firebolt: Small, Medium, Large, and X-Large. Each node type provides a certain amount of CPU, RAM, and SSD. These resources scale linearly with the node type. For example, an "M" type node provides twice as much CPU, RAM, and SSD as a "S" type node.

For more information, check out the Engine Fundamentals article in our documentation.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/what-are-the-different-types-of-nodes-available-in-firebolt

Engines

COPY LINK TO ANSWER

Button Text

What are the key dimensions of an engine that determine its topology?

An engine has three key dimensions:

Type - This refers to the type of nodes used in an engine.

Cluster - A collection of nodes of the same type.

Nodes - The number of nodes in each cluster.

An engine comprises one or more clusters. Every cluster in the engine has the same type and the same number of nodes.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/what-are-the-key-dimensions-of-an-engine-that-determine-its-topology

Engines

COPY LINK TO ANSWER

Button Text

What are Firebolt Engines?

In Firebolt, an “engine” refers to a virtual compute resource that provides the processing power to execute queries, load data, and perform various SQL driven tasks. Unlike traditional cloud data warehouses, Firebolt engines can be resized, paused, and resumed in a much more granular, and cost effective way to optimize performance and cost.

https://firebolt013marketing.webflow.io/faqs-v2-knowledge-center/what-are-firebolt-engines

Engines

COPY LINK TO ANSWER

Button Text

What steps are involved in switching production workloads to Firebolt once migration validation is complete?

What are Firebolt’s best practices for handling complex dashboard queries with varying granularity (e.g., daily, weekly, monthly)?

How can performance risks be mitigated when dealing with tenants of significantly different data sizes?

What are Firebolt’s best practices regarding the use of views versus pre-joined tables for aggregations?

Are primary indexes critical to Firebolt's query performance, and how should they be managed during migration?

How do I invite more users to join my account?

What are the best practices for setting up Superset to connect to Firebolt for dashboarding?

How should primary indexes be selected in Firebolt to optimize query performance?

What factors significantly impact query performance when joining high-cardinality tables in Firebolt?

Are semi-joins (WHERE IN clauses) generally more performant in Firebolt than explicit joins for filtering datasets?

How do I query my S3 buckets using IAM roles?

What is the difference between CPU time and thread time in Firebolt's query profile analysis, and why is it important?

What's the best practice for setting up a Firebolt engine to support high concurrency?

How can I learn if there is an issue causing interruption to Firebolt services or applications?

What's the easiest way to label a query when running it from the Python SDK?

Explore All FAQs

How can I make sure that my engines are not sitting idle and incurring infrastructure costs?

How do I start and stop engines in Firebolt?

Will the creation of an engine automatically result in the creation of the underlying cluster(s)?

What happens to my currently running queries when I perform a scaling operation?

Do scaling operations result in any downtime for my applications?

How does scaling work with Firebolt engines?

How do I create an engine in Firebolt?

What is the typical start-up time for the Firebolt engine? Is it Guaranteed?

Is there a limit on the number of databases a given engine can support?

Do engines and databases have a one-to-one relationship?

How many clusters can I use per engine?

How many nodes can I use for each cluster in a given engine?

What are the different types of nodes available in Firebolt?

What are the key dimensions of an engine that determine its topology?

What are Firebolt Engines?

We use cookies to give you a better online experience