Department of Data Science & Analytics

The Department of Data Science at the Institute of Computer Science deals with the extraction of knowledge from large and complex data sets. The starting point for this is the processing and storage of data using relational database systems and the use of new, highly distributed technologies (NoSQL databases and Hadoop).

Data engineering is used to convert the often heterogeneous data into a usable form that allows information and insights to be gained using advanced analytical methods (analytics). In addition to traditional ad-hoc analyses, statistical methods and intelligent algorithms are also used (data mining). A particular challenge here is the need for distributed processing due to the large volumes of data with suitable software tools in the cluster network. In addition, artificial intelligence methods (machine learning) are also used at the highest level in the field of data science, for example to make predictions. This enables companies to make objective and data-supported decisions.

In the field of data science, professors and staff work with these technologies in various application areas: Medicine, the energy industry and business intelligence.

Our key areas of expertise

Our teaching

The following courses, among others, are supported by the department

Our equipment

  • Seminar room with 20 workstations
  • Project room for data science projects with 12 workstations
  • Numerous software packages for data modeling and management in the PC pools
  • Data science cluster with 10 computing nodes for distributed databases and analytics (e.g. Hadoop)
Copyright: THU
Data Science & Analytics - Use of low-cost hardware to realize a Hadoop cluster for distributed storage and processing of large data sets
Copyright: Copyright
Description
Copyright: Copyright
Description

Contact us