The Snowflake documentation says I can view my warehouse utilization in the Snowflake Classic UI:
https://docs.snowflake.com/en/user-guide/warehouses-load-monitoring.html
Is this available in the new UI?
Yes, it is available and it's explained here.
It basically looks like this:
Related
Hi :) I'm a novice when it comes to setting up a DWH from scratch. I have chosen Snowflake as our DWH, and am now trying to set up the ELT flows.
I understand that Snowflake has a Task object, which can run SQL and be scheduled as well.
However, there is a big push from the data community to use dbt for managing the T part of ELT.
Please can you tell me what's the difference between using Snowflake's Objects of Streaming and Task scheduling to dbt.
Thank you.
Here are two good links that explain why dbt is so great for data transformation tasks.
If I should sum it up:
Modularity (models, references, lineage)
Environments (development, deploy, scheduling)
Quality maintenance (version control (GitHub) & collaboration, tests, documentation)
Links
DBT's viewpoint: https://docs.getdbt.com/docs/about/viewpoint
And even the Snowflake community agrees: https://community.snowflake.com/s/article/Don-t-Do-Analytics-Engineering-in-Snowflake-Until-You-Read-This-Hint-dbt
Is there any tools or good formats to keep a documentation of NoSQL database?
The use cases are:
Keep track of the current db collections and their properties defined in schema
Having detailed description of what each properties means
Maintain a change log
Design the collection schemas before implementing
Help building new projects by new teams when they're using the same db
All these can be done using Google doc, confluence or other documentation tools. I want it know if there's any standard practises to make this documentation and if there are any existing tool out there to help with this.
I am curious to know how exactly elasticsearch manages data at its server? Does it have an inbuilt nosql database or does it store in files or is it using some existing db like mysql, mongodb etc?
Elasticsearch internally uses Lucene which uses the segments(stored in file system) to store the actual data and it uses the inverted index to enable the fast search capabilities.
Please refer elasticsearch official blog on bottom up which explains above statment in quite detail with examples.
How can I read multiple tables from salesforce?
Can I use groovy script? Or Salesforce connector provide to facility reading multiple tables??
Is there any other way ??
The SFDC connector supports Salesforce Query language (SOQL).
It's similar to SQL but it has it's limitations, JOINS are one of them.
SOQL does offers a way to query more than one table at the time by using relationship queries, it may not always be what you need but it get's you closer.
Please take a look at SFDC documentation, http://www.salesforce.com/us/developer/docs/officetoolkit/Content/sforce_api_calls_soql_relationships.htm
Again this language works with the SFDC connector so if you need more information about it you'll be better of asking in a SFDC forum ;)
HTH
you can use Native query language instead of Datasense query Language in your salesforce connector for query.
I am new to Semantic Web and I have a very basic question about
the JENA RDF Dataset. I read it from the documentation that a dataset is
a collection of graphs (or Models in the Java API). If I view the graph
(the model) as a RDF Alternative to Relational DB's 'Table', can I view
the dataset as a 'Database' ?
If so, then with TDB I should be able to create multiple
datasets. However in the documentation it says 'Every dataset is
obtained via TDBFactory.createDataset(Location) within a JVM is the same
dataset.' (http://jena.apache.org/documentation/tdb/datasets.html). I
also recall that the documentation said somewhere that the TDB does not
support multiple JVM now. Does that mean with TDB I can create ONLY ONE
dataset?
This is Andy's answer to my question in the jena users mailing list. Thanks, Andy.
Hi, Everyone
I am new to Semantic Web and I have a very basic question about
the JENA RDF Dataset. I read it from the documentation that a dataset is
a collection of graphs (or Models in the Java API). If I view the graph
(the model) as a RDF Alternative to Relational DB's 'Table', can I view
the dataset as a 'Database' ?
yes - sort of.
If so, then with TDB I should be able to create multiple
datasets. However in the documentation it says 'Every dataset is
obtained via TDBFactory.createDataset(Location) within a JVM is the same
dataset.' (http://jena.apache.org/documentation/tdb/datasets.html).
... for the same "location" argument ...
TDBFactory.createDataset("DB1") ;
TDBFactory.createDataset("DB2") ;
are different datasets
I
also recall that the documentation said somewhere that the TDB does not
support multiple JVM now. Does that mean with TDB I can create ONLY ONE
dataset?
TDB is the core database engine, and when used directly, you are using
in a kind of embedded mode.
You can use Jena Fuseki for sharing a dataset between applications (just
like you might share an SQL database between apps, except it's HTTP not
JDBC).
Andy