Information Processing and Data Visualization
Lesson Code Course Name Class Credit Lesson Time Weekly Lesson Hours (Theoretical) Weekly Lesson Hours (Practice) Weekly Class Hours (Laboratory)
UDOSZhA 4214 Development Of Big Data Processing And Storage Systems төртінші курс 5 150 1 2 2
Course Descriptions
Kazakh
PhD A.Abibullayeva

The subject is to teach students the skills of finding and interpreting big data sources, managing large amounts of data, combining data sources, ensuring the coherence of data sets and creating visualization that helps to understand data, building mathematical models using data, as well as practical work in the field of prerequisite: Post-Requisite: in particular, in the field of natural language processing.

---

Algorithms and data structures, database theory

Narrative, exchange of views, discussion, problem methods.

1Learns to analyze large amounts of information and organize data management.
2Implemented using the latest data processing and analysis Technologies.
3Will be able to create new models of the organization's information infrastructure, taking into account the capabilities of Big Data Technology.
4Intensively learns theoretical and practical aspects in the field of data analysis.
5Can develop and design various components of 5 - Distance databases and Information Systems.
6Uses and builds internet applications.
Haftalık KonuEvaluation Method
1Description of big data. The importance of data.
2Big data tools and their function.
3Basic data about distributions. Advantages and differences of Apache, Cloudera, Horton Works
4Hadoop architecture.
5Description of the platform. Components of the' HADOOP ' ecosystem.
6The basic principles of Hadoop. Hadoop components. Hadoop 2.0
7Ways MapReduce functions work with big data. MapReduce-algorithmization in the form of a graph.
8Technology of operation of Hive, Pig components. Execution of requests in components.
9Infrastructure and structural data generation via Hive.
10Framework for scaling Big Data Solutions: RDBMS, NoSQL and HBase
11Large-scale solutions using real data. MongoDB.
12Data analytics and visualization. Processing specific languages: Apache Kafka.
13Solutions in Apache Falcon and Ozie components. Technologies for the operation of Spark and Storm components.
14Machine learning library-Mahout's algorithm for working. Mahout clustering.
15Demonstration of the relationship between big data and artificial intelligence through the 'HADOOP' ecosystem.
Relationship between the Curriculum and Learning Outcomes
PÇ1PÇ2PÇ3PÇ4PÇ5PÇ6PÇ7PÇ8PÇ9PÇ10PÇ11PÇ12
Textbook / Material / Recommended Resources
1Исахметов Т.И., Шадиева А.А., Жаздыкбаева Д.П., Big Data технологиялар. Алматы -2022.
2A. K. Mukasheva, T. F. Umarov, I. A. Zimin, Big data analytics. Textbook, Almaty, 2022.
3Деректер қоры жүйелері Нур-Принт Алматы, 2012ж. Оқу-әдістемелік құрал
4Технологии и инфраструктура BIG DATA, И. А. Радченко, И. Н. Николаев, 2018, ИТМО, учебник, СПб.52
5Силен, Д. и др. Основы Data Science и Big Data. Python и наука о данных. / Д. Силен, А. Мейсман, М. Али.