BIG DATA MANAGEMENT

Teaching in italian
BIG DATA MANAGEMENT
Teaching
BIG DATA MANAGEMENT
Subject area
ING-INF/05
Reference degree course
COMPUTER ENGINEERING
Course type
Master's Degree
Credits
9.0
Teaching hours
Frontal Hours: 81.0
Academic year
2020/2021
Year taught
2021/2022
Course year
2
Language
ENGLISH
Curriculum
PERCORSO COMUNE
Location
Lecce

Teaching description

Teaching program is provisional and may be subject to changes

Good knowledge of Object Oriented Languages (at least 1), techniques and tools. Elements of computer networks and Web technologies.

The aim is to provide the basics about the main database theories, techniques and tools to design / implement databases and database applications.

 

Topics:

•Database, relational databases, NoSQL and NewSQL;

•DataBase Management Systems;

•Relational Model and Relational Algebra;

•SQL: data definition and manipulation;

•Basics of Human-Computer Interaction and interface design for DB;

•Architectural aspects: Clients, Servers, Peers, Devices, IoT, …

•Big data, data lakes, data analytics, machine learning, AI;

Knowledge and understanding. Students must have a solid background related to the basics of big data management and information systems:

  • They must have the basis to think analytically, creativelly and critically and being able to create abstraction and problem solving skills to cope with complex systems
  • They must have a basic knowledge of design and implementation of big data management systems
  • They must have the tools to design transactional and analytical databases applied to different contexts
  • They must have the skills to argument data in different scenario, the tools for managing them, together with its impact.

Applying knowledge and understanding. After the course the student should be able to:

  • Describe the model and frameworks of an Information System; illustrate the main components of an information system from the technical and application perspective.
  • Distinguish conceptual, logical and physical models in big data management.
  • Model Online Transaction processing systems from a big data perspective, distinguishing among conceptual models, relational models and physical models
  • Model Online Analytical processing systems form a data perspective, distinguishing among conceptual, logical and physical models, being able to describe the relationships among them and the processes

Making judgements. Students are guided to critically approach the topics treated during the class, to compare different solutions to a problem, to identify and propose the most effective or efficient solution in an autonomous way.

Communication. Students must learn to communicate with heterogeneous audiences, explaining their position, in logical, coherent and effective way. During the course students will be provided with domain specific vocabulary and the proper scientific knowledge and methods to expose and argument in precise and formal way the main topics related to big data management and information systems

Learning skills. Students must acquire the critical ability to autonomously relate to the typical problems of  data and information management and, in general, cultural issues related to information systems and their management. They should be able to develop an approach to independently structure knowledge and methods learnt with a view to possible continuation of studies at higher (doctoral) level or in the broader perspective of cultural and professional self-improvement of lifelong learning. Therefore, students should be able to switch their learning approach according to different learning sources and the objectives they must achieve in terms of results and audience

The course aims to provide students with tools and knowledge for data management in business organizations. The course consists of frontal lessons and classroom hands on exercises. The frontal lessons are aimed at improving students' knowledge and understanding through the presentation of theories, models and methods; students are invited to participate in the lesson with autonomy of judgement, by asking questions and presenting examples. The exercises are aimed at using tools which supports the models and approaches presented

The exam is an interview made up of both practical and descriptive aspects

The practical part aims at evaluating to what extent the student has: 1) the ability to design data models according to the methodologies presented during the call, 2) reasoning about his/her choices and the capacity to integrate different concepts and tools.

The descriptive part follows the practical part and is aimed to verify to what extent the student has gained knowledge and understanding of selected topics and he is able to communicate them.

Office Hours

By appointment; contact the instructor by email or at the end of class meetings.

Chapters:
- 1: Databases and Database Users
- 2: Database System Concepts and Architecture
- 3: Data Modeling Using the Entity–Relationship (ER) Model
- 4: The Enhanced Entity–Relationship (EER) Model
- 5: The Relational Data Model and Relational Database Constraints
- 6: Basic SQL
- 7: More SQL: Complex Queries, Triggers, Views, and Schema Modification
- 8: The Relational Algebra and Relational Calculus
    8.1: Unary Relational Operations: SELECT and PROJECT
    8.2: Relational Algebra Operations from Set Theory
    8.3: Binary Relational Operations: JOIN and DIVISION
    8.4: Additional Relational Operations
    8.5:  Examples of Queries in Relational Algebra
- 9: Relational Database Design by ER- and EER-to-Relational Mapping
- 10: Introduction to SQL Programming Techniques
- 11: Web Database Programming Using PHP
- 12: Object and Object-Relational Databases
- 14: Basics of Functional Dependencies and Normalization for Relational Databases
    14.1: Informal Design Guidelines for Relation Schemas
    14.2: Functional Dependencies
    14.3: Normal Forms Based on Primary Keys
    14.4: General Definitions of Second and Third Normal Forms
    14.5: Boyce-Codd Normal Form
- 16: Disk Storage, Basic File Structures, Hashing, and Modern Storage Architectures
- 17: Indexing Structures for Files and Physical Database Design
- 20: Introduction to Transaction Processing Concepts and Theory

- 21: Concurrency Control Techniques

 

Teaching material: more concepts on requirement elicitation and database application design and implementation, multidimensional analisys, datawarehouse, big data, big data management, database security, database administration, NoSQL, NewSQL, distributed databases.

R. Elmasri, S. Navathe, Fundamental of Database Systems, 7a edizione, Pearson ed.

Balamurugan Balusamy, Nandhini Abirami R, Amir H. Gandomi, Big Data: Concepts, Technology, and Architecture,John Wiley & Sons Inc; 1. edizione 

Semester
First Semester (dal 20/09/2021 al 17/12/2021)

Exam type
Optional

Type of assessment
Oral - Final grade

Course timetable
https://easyroom.unisalento.it/Orario

Download teaching card (Apre una nuova finestra)(Apre una nuova finestra)