I am bit confused between fact and dimension tables and I am not able to clear my doubt . Thing is I have to design a schema where there is one keyword table . And corresponding to each and every keyword we have a date table and site table(that keyword is generated for which site) . Now having this scenario to work on I am very much confused regarding which table be assigned as fact and which one as dimension tables . Keyword table contains key_id and keyword name . Date table contains month , year and week . Site table contains name of site to which keyword belongs.Please suggest me architecture of this schema.
schema designing in data warehousing and mining
185 Views Asked by DEVANG PANDEY At
1
There are 1 best solutions below
Related Questions in SCHEMA
- Should Organization schema be on every page?
- How to save multiple string inside an array column in MongoDB
- How does Big Query differentiate between a day and month when we upload any CSV or text file?
- Does the language used in schema need to match the HTML lang attribute?
- graphql-java extension type redefine error from version 15
- In SilverStripe/GraphQL 4 how do I get the bulkLoader auto generated schemas to allow has_many and many_many updates?
- How to validate URL param with AJV in Fastify?
- AJV ignores 'required' rule for properties in nested, referenced object
- Form validation with yup - how to strip a field after validating it?
- JSON Schema draft v4 - additional Properties error with anyOf
- FME - Specify schema name in PostgreSQL database writer (ArcSde)
- JSONLD Issue: Works via IP but Fails with Domain/Sub-domain IN next JS 14
- How to drop all views in cascade manner from a schema in HSQL database
- Spark Not Null constrains in combination with badrecordspath for reading (delta) tables
- "Schema hasn't been registered for model" Error in mongoose only while populating
Related Questions in DATA-WAREHOUSE
- How to blind data in data warehouse when sending from preanonymized layer to anonymized layer while keeping referential integrity of all key columns
- Run Pyspark job using Matillion
- the right grain of a fact table in data warehouse
- Model the number of available spots in dimension or fact table?
- What is the most efficient way to generate a change data set given two SQL Server backup files?
- Does it make sense to use an IDENTITY column in a raw layer?
- Power BI star (constellation) schema: 2 fact tables with relation between each other
- Rolling Period Table or CTE
- Representation of sequential rules in data mining (sequence pattern mining)
- Snowflake Bulk Inserts vs. Single Row Inserts
- Data not showing in power bi report
- Data Warehouse to Power BI Desktop
- Error initialize process greenplum major upgrade 5.29.12 to 6.25.2 on centos 7
- Modify column type in Parquet file with ruby (using parquet Gem)
- Translating Snowflake warehouse usage to BigQuery
Related Questions in STAR-SCHEMA
- Star schema role in the Pentaho Mondrian OLAP server
- How to create Star Schema Benchmark(SSB) tables?
- SSIS Surrogate Key incrementation
- Can non-additive facts be part of transactional fact table
- Collapsing Rarely Changing Diminsions to a Fact Column
- Snowflake dimension with multiple levels mondrian
- schema designing in data warehousing and mining
- How are fact tables formed in relation to the dimension tables?
- what is the best way to join records of multiple dimension tables that are all connected by a common fact table
- How to populate dimensions with no matching columns?
- Table A's PK is being referenced by Table B's FK. Cannot drop Table A's PK
- SQL DWH create new foreign logic
- Creating a relationship between unconnected tables
- Order of duplicate keys in non-clustered indexes [SQL Server]
- OLTP-STAGGING-DW-DUPLICATES
Related Questions in SAIKU
- Pentaho ERROR: "every derived table must have alias"
- How to use Expressions in Measures in Schema Workbench?
- Problem with calculated member in Schema Workbench
- Saiku with Pentaho is dead?
- Saiku: How to obtain licence for Saiku Pentaho plugin?
- SAIKU UI SHARE GLOBAL VARIABLE BETWEEN VIEWS
- Saiku CE as a plugin for Pentaho 8.3
- How to count a total percentage for calculated measure in MDX (Mondrian)?
- can saiku mondrian make queries based on transactional data
- saiku application stuck at loading instead of loading login page
- Unable to deploy Saiku on Tomcat. Saiku gets stuck on the loading screen
- Saiku/Hitachi-Pentaho Schema Workbench
- Does Apache Kylin can still work with Mondrian 4.4 and Saiku CE?
- Exploring Apache Kylin cubes with Power BI?
- Why i cant install saiku in pentaho server 8.2
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
What are you measuring?
Are you counting how many times a keyword is generated for each site? All three of those tables are dimensions (assuming your date table has every date in it regardless). You need another table which is your fact, which tells you how many times a keyword was generated for the day (or even the hour - you should start as low as possible)
To put it another way - for a given site, can a keyword been generated more than once in a week? Was it generated 10 times in a week? Here's your fact table record:
In this example,
1joins to the primary key of your Date dimension,6joins to the primary key of your Site dimension and7joins to the primary key of your Keyword dimension.