Could you please provide me with an article that gives an example of a DB design that is in 3NF but not in BCNF and then illustrates how to convert it to BCNF? All the articles that I saw which try to explain BCNF give examples of tables that are in 1NF and then convert them to BCNF. This doesn't let me see the difference between 3NF and BCNF.
Thanks in advance
An example with overlapping keys reveals the difference; having the predicate
[P]
and matching constraints(c x.y)
.[P] Employee
EMP
, with emailEMAIL
, took courseCRS
in yearYR
.(c 1.1) For each employee and course; that employee took that course at most once; it is possible that more than one employee took that course.
(c 1.2) For each each employee and course; that employee took that course in exactly one year.
(c 1.3) For each employee and year; it is possible that the employee took more than one course in that year.
(c 1.4) For each course and year; it is possible that more than one employee took that course in that year.
(c 2.1) For each employee, that employee has exactly one email.
(c 2.2) For each email, exactly one employee has that email.
(c 3.1) For each email and course; employee with that email took that course at most once; it is possible that more than one employee with that email took that course.
(c 3.2) For each each email and course; employee with that email took that course in exactly one year.
(c 3.3) For each email and year; it is possible that employee with that email took more than one course in that year.
(c 3.4) For each course and year; it is possible that more than one employee with specific email took that course in that year.
The FDs for this are
So, considering each one of these as
FD X --> Y
it holds that either,Therefore the
R
is in the 3rd NF.For BCNF, the requirement is that for any nontrivial
FD X --> Y
inR
, X is a superkey.Here is a check-list for 2NF to BCNF
Now we could decompose
R
into:which eliminates those two FDs to subkeys.
And finally, note that after decomposition into
{EMP, EMAIL} {EMP, CRS, YR}
or into{EMP, EMAIL} {EMAIL, CRS, YR}
these tables are now all in5NF
-- actually in6NF
, but that's not important now. It is important to observe that it is possible to get into5NF
-- and hence into: (4, BCNF, ..., 1) -- just by using logic, verbalizing predicate and constraints. In other words, for a developer: