Advances in Data Science: Symbolic, Complex, and Network Data

  • 3h 51m
  • Edwin Diday, Gilbert Saporta, Huiwen Wang (eds), Rong Guan
  • John Wiley & Sons (US)
  • 2020

Data science unifies statistics, data analysis and machine learning to achieve a better understanding of the masses of data which are produced today, and to improve prediction. Special kinds of data (symbolic, network, complex, compositional) are increasingly frequent in data science. These data require specific methodologies, but there is a lack of reference work in this field.

Advances in Data Science fills this gap. It presents a collection of up-to-date contributions by eminent scholars following two international workshops held in Beijing and Paris. The 10 chapters are organized into four parts: Symbolic Data, Complex Data, Network Data and Clustering. They include fundamental contributions, as well as applications to several domains, including business and the social sciences.

About the Authors

Edwin Diday is Emeritus Professor at Paris-Dauphine University-PSL. He helped to introduce the symbolic data analysis paradigm and the dynamic clustering method (opening the path to local models), as well as pyramidal clustering for spatial representation of overlapping clusters.

Rong Guan is Associate Professor at the School of Statistics and Mathematics, Central University of Finance and Economics, Beijing. Her research covers complex and symbolic data analysis and financial distress diagnosis.

Gilbert Saporta is Emeritus Professor at Conservatoire National des Arts et Métiers, France. His current research focuses on functional data analysis and clusterwise and sparse methods. He is Honorary President of the French Statistical Society.

Huiwen Wang is Professor at the School of Economics and Management, Beihang University, Beijing. Her research covers dimension reduction, PLS regression, symbolic data analysis, compositional data analysis, functional data analysis and statistical modeling methods for mixed data.

In this Book

  • Explanatory Tools for Machine Learning in the Symbolic Data Analysis Framework
  • Likelihood in the Symbolic Context
  • Dimension Reduction and Visualization of Symbolic Interval-Valued Data Using Sliced Inverse Regression
  • On the “Complexity” of Social Reality. Some Reflections about the Use of Symbolic Data Analysis in Social Sciences
  • A Spatial Dependence Measure and Prediction of Georeferenced Data Streams Summarized by Histograms
  • Incremental Calculation Framework for Complex Data
  • Recommender Systems and Attributed Networks
  • Attributed Networks Partitioning Based on Modularity Optimization
  • A Novel Clustering Method with Automatic Weighting of Tables and Variables
  • Clustering and Generalized ANOVA for Symbolic Data Constructed from Open Data