I have installed Rational Software Architect(RSA) for data modeling purpose. I'm very new to this area. Can someone help me with the connection establishment between RSA and the database.
You need to use IBM InfoSphere Data Architect (previously known as Rational Data Architect) as an intermediate conversion step if you want to reverse engineer a data model from a database into RSA.
InfoSphere Data Architect will allow you to connect to a database and reverse engineer it into a Physical Data Model (PDM). Once you have the PDM, you should convert it to a Logical Data Model (LDM) then transform to a UML model using the "data / transform" option in InfoSphere Data Architect.
Once you have done that you can load the UML representation of the data model into Rational Software Architect.
You can use a New "Physical Data Model", and in the "model file" choose from a database or from a Script, and that´s all. You do a reverse engineer, you dont need (IBM InfoSphere Data Architect)
Related
I'm an entry level data analyst for a medium sized franchisor overseeing ~100+ franchise locations.
I'm looking to implement a BI tool (such as Power BI), however we do not have direct access to any of our source data (no APIs or DB access). i.e. we must download our data in the form of reports from each of our 5+ IT platforms.
I'm currently using Excel and Power Query to convert these reports to a usable format for ad-hoc data analysis, however this is not an ideal solution for future BI requirements and historical data analysis work. I'm not sure the best method or tools to use to essentially create a new database from these reports (which are not always in a flat format).
Considering my situation, does anyone have any recommendations on any database platforms (i.e. MS Access, AWS Redshift, Azure, etc.) and/or ETL solutions so I could simply download the files and have them automatically "cleansed" and uploaded to a database?
Thank you
what exactly i want to ask u is like
In congnos we are using frame work manager as metadata modeler and semantic layer
similarly is there any tool or semantic layer like frame work manager in microsoft power BI
Yes - it's Power BI Desktop. It has a Power Query (Query Editor) for data integrations, Relationship View for table relationships, and Quick Measures backed by the DAX language for added calculations. Here's some starting points for deeper info:
https://powerbi.microsoft.com/en-us/documentation/powerbi-desktop-query-overview/
https://powerbi.microsoft.com/en-us/documentation/powerbi-desktop-relationship-view/
Published datasets can be resused on other reports.
Data warehousing seems to be a big trend these days, and is very interesting to me. I'm trying to acquaint myself with its concepts, and am having a problem "seeing the forest through the trees" because all of the data warehouse models and descriptions I can find online are theoretical, but don't gives examples with actual technologies being used. I'm a contextual learner, so abstracted, theoretical explanations don't really help me out all that much.
Now there seem to be many "data warehousing models", but all of them seem to have some similar characteristics. There is ually an "ODS" (operational data store that aggregates data from multiple sources into the same place. A process known as "ETL" then converts data in this ODS into a "data vault", and again into "data" and/or "strategy marts."
Can someone provide an example of the technologies that would be used for each of these components (ODS, ETL, data vault, data/strategy marts)?
It sounds like the ODS could just be any ordinary database, but the data vault seems to have some special things going on because it is used by these "marts" to pull data from.
ETL is the biggest thing I'm choking on by far. Is this a language? A framework? An algorithm?
I think once I see a concrete example of what's going on at each step of the way, I'll finally get it. Thanks in advance!
ETL is a process. The abbreviation stands for Extract-Transform-Load which describes what is being done with data during the process. The process can be implemented anywhere where you need to create a bridge between two systems with differenet data formats. First, you need to pull (exract) data from a source system (database, flat files, web service etc.), Then data are being processed (transform) to comply with format of a target storage (again it can vary: databases, files, API calls). During the transform step, further actions can be performed on the data set as enrichment with data from other sources, cleansing and improving its quality. The last step is loading transformed data into a target storage.
Typically, an ETL process is employed for loading a datawarehouse, migrating data from one system or database to another during moving from a legacy system to new one, synchronizing data between two or more systems. It is also used as an intermediate layer in broader MDM and BI solutions.
In terms of specific software, there are many ETL tools on the market ranging from robust solutions from big players as Informatica, IBM DataStage, Oracle Data Integrator, to more affordable and open source providers as CloverETL, Talend, or Pentaho. The most of these tools offer a GUI where flow and processing of data is defined through diagrams.
For Microsoft SQL Server 2005 and later the ETL tool is called SSIS (SQL Server Integration Services). If you install at least the Standard version of the SQL Server you get the Business Intelligence Developer Studio with which you can design your data flows. Basically what an ETL tool does is take data from one or more sources (tables, flat files, ...) then transform it (add columns, join, filter, map to different data types, etc.) and finally store it again to one or more tables or files.
To get a basic understanding of how something works you can watch e.g. this video or this one (both from midnightdba). They're a bit lengthy, but you get an idea. They certainly helped me in understanding the basic functionality of an ETL tool.
Unfortunately I have not yet digged into other platforms or tools.
I'd highly recommend checking out some of the books by Ralph Kimball and Margy Ross (The Data Warehouse Toolkit, The Data Warehouse Lifecycle Toolkit) for an introduction to data warehousing.
My company's data warehouse is built using the Oracle Warehouse Builder tool for ETL. The OWB is a GUI tool that generates PL/SQL code on the database to manipulate the data. After manipulation and cleansing, the data is published to an Oracle datamart. The datamart is a database instance that users access for ad-hoc querying via Oracle Discoverer (Java software).
I would like to know if there is any tool able to extract the design from an existing Oracle database.
I have a read access to it and I would like to have the design. Or simply export it to a *.sql and then read it with any tool able to create the design with the links between the tables.
Something like that
(source: fileguru.com)
Thank you very much.
There are a number of tools which do reverse engineering from an Oracle database, but most require a license.
Oracle has its own Data Modeller. Previously Oracle tried to charge an exorbitant licence fee but obviously they had few takers because it is now free. It is not a great product but it does a decent job on reverse engineering.
If you are a site which uses TOAD then you should definitely consider Quest's TOAD Data Modeler. The additional cost is not a great stretch from the basic TOAD fee.
There are not many other free tools left on the market but SQL Power offer a community edition of their SQL Architect tool which does reverse engineering. Check it out.
Oracle SQL Data Modeler answered to all my questions.
It has a reverse engineering tool that creates the logical and physical model from the database.
In Visio 2003 Architect version I can generate a SQL Server Database on the fly from UML ER diagram. But there is no such a way by using Visio 2007. My question is: Any other good and latest tools can support ER-> DB or DB->ER very well for SQL2005/2008?
I think Enterprise Architect can do that too.
http://www.sparxsystems.com.au/
You can do the ER diagram on Visual Studio creating a dbml file and then export to a Sql Server script using the huagati plugin. http://www.huagati.com/dbmltools/
If you're looking for a free/open source tool to achieve this, you could try:
DBDesigner
From the website:
DBDesigner 4 is a visual database design system that integrates database design, modeling, creation and maintenance into a single, seamless environment.
It combines professional features and a clear and simple user interface to offer the most efficient way to handle your databases.
DBDesigner 4 compares to products like Oracle's Designer©, IBM's Rational Rose©, Computer Associates's ERwin© and theKompany's DataArchitect© but is an Open Source Project available for Microsoft Windows© 2k/XP and Linux KDE/Gnome. It is release on the GPL.
And it supports the following features:
Available on Linux / MS Windows
User Interfaced based on industry standard layouting software
Design Mode / Query Mode
Reverse engineering MySQL, Oracle, MSSQL and any ODBC databases
Userdefined schema-generation
Model-To-Database syncronisation
Index support
Automatic foreign key placement
Weak entity support
Standard Inserts storing and sync
Full documentation capabilities
Advanced model printing
Output as image
All MySQL datatypes with all options
User defined datatypes
Database storage, ability to save model within database
Network-/Multiuser access through database storage
Version control*
SQL Query Builder
SQL Command History
SQL Command storage within model
Plugin interface
I think the bolded ones above (my own emphasis) should give you what you require (i.e. effectively two-way database/model synchronization).
I have used the tool in the past to reverse engineer an existing database to an ER diagram. From what I remember, it wasn't the most stable software I'd ever used (I seem to recall it crashed a couple of times) however, this was a little while ago so it may be more stable now.
I remember that in Visio 2003 that was only possible, I believe in the Enterprise Edition. Could that be your issue with 2007?
Most of the tools don't handle SQL 2005/2008 correctly. But CA ERwin® Process Modeler does the best job (at a cost though).
There is a community edition but it is limited to 25 objects per model.
Give it a try and let us know.