代做UFCFLR-15-M Data Management Fundamentals 2020/2021代写Python编程
- 首页 >> DatabaseUFCFLR-15-M
Data Management Fundamentals
2020/2021
ONLINE EXAM
Instructions to Students:
· Exam has two sections (A,B). Answer ALL questions in A and ANY TWO questions in B.
· Section A carries 40 marks. Section B carries 60 marks.
· As is usual for an exam, for this assessment you are not expected to include full referencing, but are encouraged to cite the sources of key theories, models, case studies, statutes etc.
· This is an individual assessment: do not copy and paste work from any other source or work with any other person during this exam. Text-matching software will be used on all submissions.
· 3000 words. There is no +/- 10% on word count and anything after the maximum word count will not be marked, in line with UWE’s Word Count Policy.
Formatting
Please use the following file format(s) (Word or pdf) . We cannot ensure that other formats are compatible with markers’ software and cannot guarantee to mark incorrect formats.
Please include the module name and number and your student number (not your name).
Please indicate clearly which questions you are answering.
Instructions for submission
· Multiple submissions can be made to the portal, but only the final one will be accepted. Please save your work frequently.
· It is your responsibility to submit exam in a format stipulated above
Your marks may be affected if your tutor cannot open or properly view your submission.
· Do not leave submission to the very last minute. Always allow time in case of technical issues.
· The date and time of your submission is taken from the Blackboard server and is recorded when your submission is complete, not when you click Submit.
· It is essential that you check that you have submitted the correct file(s), and that each complete file was received. Submission receipts are accessed from the Coursework tab.
· There is no late submission permitted on this timed assessment.
Instructions to Candidates:
Answer all questions in Section A and any two (2) questions in Section B.
SECTION A
Answer ALL questions in this section.
Consider the following flat file database:
cust-id |
name |
address |
product-id |
desc |
quantity |
price |
order-id |
order-date |
12 |
Mr Joe Bloggs |
5 Muller Rd .. |
75 |
widget |
7 |
14.00 |
34 |
01/3/10 |
12 |
Mr Joe Bloggs |
5 Muller Rd .. |
12 |
nibbet |
3 |
9.00 |
34 |
01/3/10 |
13 |
Mr Harry Hill |
43 Dry Av.. |
75 |
widget |
4 |
8.00 |
36 |
01/03/10 |
13 |
Mr Harry Hill |
43 Dry Av.. |
23 |
gibbet |
2 |
3.50 |
36 |
01/03/10 |
14 |
Ms Jean Grey |
34 Tree Hill… |
12 |
nibbet |
5 |
12.50 |
43 |
03/03/10 |
Note: Only 5 records are shown and address field is concatenated.
QUESTION 1
Explain the major problems associated with holding and using data in this format. (12 marks)
QUESTION 2
Identify and explain the major advantages a RDMS offers over a flat-file database when holding and processing large amounts of data. (12 marks)
QUESTION 3
Analyse & normalize the above data to 3NF, create any entities (including implied entities) and draw a low-level ER diagram showing all entities, attributes, keys and foreign keys (these must be clearly marked).
NOTE: You do need to show your Normalization steps but only the final E-R model. (16 marks)
Question 4
Consider the following CSV file – 4 rows shown:
quote|source|dob-dod|category
There is no remedy but to love more.|Henry David Thoreau|1817-1862|romantic
Work is the curse of the drinking classes.|Oscar Wilde|1854-1900|humourous
The paradise of the rich is made out of the hell of the poor.|Victor Hugo|1802-1885|political
Religion is what keeps the poor from murdering the rich.|Napoleon Bonaparte|1769-1821|political
a) Not using any external modules, write a PYTHON script. to transform. the above data to a valid XML structure using the header fields as element names. You should also generate a row-id as an attribute for each row in the file, (12 marks)
b) Only using the pymysql & csv modules, write a PYTHON script. to create the three required tables and insert the CSV data above to a MySQL database. (18 marks)
Question 5
a) Identify and briefly explain the four different types of NoSQL data stores, giving examples of typical usage and available products. (16 marks)
b) Explain how the key business drivers of Volume, Velocity, Variability and Agility are driving commercial and public service based organizations to adopt NoSQL based systems and solutions. (14 marks)
Question 6
a) Identify & describe at least six factors that can be used to evaluate the quality and veracity of a Data Model. (12 marks)
b) With the aid of a diagram, distinguish between the Conceptual, Logical and Physical Data Models. Show how the different models relate to each other. (12 marks)
c) Identify the different agents involved in the Data Modelling process and their roles. (6 marks)
Question 7
a) With reference to the notion of “Polyglot Persistence”, describe and explain the reasons why organizations in both the private and public sectors are taking up NoSQL based solutions to meet their data storage and processing needs. (8 marks)
b) There are a range of parameters used to measure the strength and weaknesses of NoSQL & SQL databases. Scalability is one of these measures. What is the difference between horizontal and vertical scaling in and of these databases? (10 marks)
c) You are creating a web-based interface to allow for the ad-hoc querying and reporting on a large (and growing) collection of various types of customer invoice, where the main key is an invoice ID (but the customers and products purchased are also of high interest). With reference to key-value, document and graph store NoSQL systems, evaluate each as a potential persistence layer for the application. (12 marks)