Teaching GuideTerm Faculty of Computer Science |
Mestrado Universitario en Computación de Altas Prestacións / High Performance Computing (Mod. Virtual) |
Subjects |
Data Analytics with HPC |
Contents |
|
|
|
Identifying Data | 2023/24 | |||||||||||||
Subject | Data Analytics with HPC | Code | 614973108 | |||||||||||
Study programme |
|
|||||||||||||
Descriptors | Cycle | Period | Year | Type | Credits | |||||||||
Official Master's Degree | 2nd four-month period |
First | Optional | 6 | ||||||||||
|
Topic | Sub-topic |
1. Introduction to Data Engineering | 1.1 HPC vs Big Data: similarities and differences in data management. 1.2 Hardware and Software Technologies for High Performance Data Engineering 1.3 Data Engineering in HPC infrastructures vs. Cloud environments |
2. Introduction to Data Analytics | 2.1 Exploratory Data Analytics 2.2 Introduction to Machine Learning |
3. Data Engineering phases | 3.1 Modeling (Formats, Compression, Designing Schemas) 3.2 Intake (Periodicity, Transformations, Tools) 3.3 Storage (HDFS and NoSQL DBs, HBase, MongoDB, Cassandra) 3.4 Processing (Batch, Real-Time) 3.5 Orchestration 3.6 Analysis (SQL, Machine Learning, Graphs, UI) 3.7 Governance 3.8 Integration with BI (Visualization) |
4 Use cases | 4.1 Applications to Internet of Things (Smart environments and Industry 4.0) 4.2 Applications to sciences and engineering |
|