Efficient data management tools for the heterogeneous big data warehouse / A. A. Alekseev [et al.]
Уровень набора: Physics of Particles and Nuclei Letters, Scientific JournalЯзык: английский.Страна: Россия.Резюме или реферат: The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics..Примечания о наличии в документе библиографии/указателя: [References: p. 692 (4 tit.)].Аудитория: .Тематика: электронный ресурс | труды учёных ТПУ | СУБД | система управления базами данных | Relational Database Management System | RDBMS | NoSQL | SQL | Big Data | Heterogeneous Data Warehouse | хранилища | данные | Apache Hadoop | Hive | MongoDB | Data Manipulation Language | DML Ресурсы он-лайн:Щелкните здесь для доступа в онлайнTitle screen
[References: p. 692 (4 tit.)]
The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics.
Для данного заглавия нет комментариев.