I am bit confused between fact and dimension tables and I am not able to clear my doubt . Thing is I have to design a schema where there is one keyword table . And corresponding to each and every keyword we have a date table and site table(that keyword is generated for which site) . Now having this scenario to work on I am very much confused regarding which table be assigned as fact and which one as dimension tables . Keyword table contains key_id and keyword name . Date table contains month , year and week . Site table contains name of site to which keyword belongs.Please suggest me architecture of this schema.
schema designing in data warehousing and mining
180 Views Asked by DEVANG PANDEY At
1
There are 1 best solutions below
Related Questions in SCHEMA
- Elasticsearch schema for multiple versions of the same text
- SQL Schema recommendations
- How does an LDAP Client eg Apache Directory Studio get schema from an LDAP Server?
- Looping functoid, Mapping
- No Synchronize Model with Database in Mysql Workbench
- XML Namespace URI with HTTPS?
- sqlite schema has quotes around table name after a table rename
- Group many postgresql databases into separate schemas into same database
- MongoDB: updating an array in array
- How to open an android application from browser
- Error when using JAXB to create JAva classes from XSD
- MongoDB data modeling issue
- Solr 5.1.0: How to set the unique key via Schema API
- Compare Array with Collection-Array containing Objects
- Updating MySQL database schema on update
Related Questions in DATA-WAREHOUSE
- Big data with spatial queries/indexing
- Joining date and time field in Tableau
- Talend Open Studio for Big Data
- spark stream and spark sql with data warehouse
- Errors in the OLAP storage engine: The attribute key cannot be found when processing
- Anchor modeling - tie: make first role?
- Is star schema still necessary for a big-data-warehouse?
- How to batch export raw data from Omniture (SiteCatalyst or Adobe Analytics)
- Omniture Data Warehouse Segments Issue
- Plotting data cubes
- SQL Server Storing DateTime as Integer
- When we use Datamart and Datawarehousing?
- How to merge two or more queries with different where conditions? I have to reuse the code which is being used in 1st where code
- Structural difference between Relational Databases vs. Multidimensional Databases
- Shell Script to Validate Filename
Related Questions in STAR-SCHEMA
- Star schema role in the Pentaho Mondrian OLAP server
- How to create Star Schema Benchmark(SSB) tables?
- SSIS Surrogate Key incrementation
- Can non-additive facts be part of transactional fact table
- Collapsing Rarely Changing Diminsions to a Fact Column
- Snowflake dimension with multiple levels mondrian
- schema designing in data warehousing and mining
- How are fact tables formed in relation to the dimension tables?
- what is the best way to join records of multiple dimension tables that are all connected by a common fact table
- How to populate dimensions with no matching columns?
- Table A's PK is being referenced by Table B's FK. Cannot drop Table A's PK
- SQL DWH create new foreign logic
- Creating a relationship between unconnected tables
- Order of duplicate keys in non-clustered indexes [SQL Server]
- OLTP-STAGGING-DW-DUPLICATES
Related Questions in SAIKU
- MDX Query in Saiku Analytics (date string to date)
- Does Saiku use Mondrian 4?
- Mondrian/Saiku - Closure Table - Null Pointer Exception
- How to avoid scentific notation in Saiku Attribute
- how to use formula in Pentaho Mondrian schema XML
- Saiku Analytics vs. Pivot4J
- schema designing in data warehousing and mining
- Cube in Analysis View not showing in Saiku Analytics
- Saiku with Pentaho is dead?
- Saiku ui deployed separately saiku No query found with name
- Saiku cannot find JDBC driver for oracle connection
- Saiku report is exporting to excel with uncorrect values
- Ugly colors on Saiku charts after upgrading to 3.0 CE
- Saiku OLAP wizard - "OK" button not working in Pentaho 5.2 Community Edition(CDE)
- How to use Expressions in Measures in Schema Workbench?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
What are you measuring?
Are you counting how many times a keyword is generated for each site? All three of those tables are dimensions (assuming your date table has every date in it regardless). You need another table which is your fact, which tells you how many times a keyword was generated for the day (or even the hour - you should start as low as possible)
To put it another way - for a given site, can a keyword been generated more than once in a week? Was it generated 10 times in a week? Here's your fact table record:
In this example,
1joins to the primary key of your Date dimension,6joins to the primary key of your Site dimension and7joins to the primary key of your Keyword dimension.