Big Data

 

Big-Data-Blog-Image

The amount of data is exponentially growing.
To deal with this huge amount of information, and to extract its enormous hidden value, the DBgroup is carrying on research about: data management, data analysis and data accessibility.
 
1. Data Management, i.e., how to handle the huge amount of data: since the volume of the data to be analysed is extremely large, the DBgroup is adopting cutting-edge technologies to manage Big Data (e.g. Apache Hadoop, Apache Spark, NoSQL/NewSQL DBMS).
 
2. Data Analysis, i.e., how to get valuable insight form the data, and how to extract information to drive decision making process: given the huge amount of involved data, traditional techniques for machine learning and, more generally, data analysis on “small” data are no longer applicable. Hence, the DBgroup is focused on developing new approach to work in this context and integrated with the systems for Data Management.
 
3. Data Accessibility, i.e., to foster the exchange and integration of data: in the described context, to actually being able to effectively and efficiently retrieve useful datasets for analysis is very challenging, due to the volume and variety of the involved data. The DBgroup is developing solutions to enable that. In particular, the focus of the DBgroup is on how to integrate difference data sources: easily accessible data sets have significantly more value if they can be easily (and automatically) integrated to each other. (This point is related also to the DBgroup research activities in the field of Linked Open Data).


News

  • DBGroup research on Big Data, presentation - slides
     
  • DBgroup will hold a course on "Big Data Analytics" in collaboration with CINECA from September 19th to September 22nd 2016.
    News - Program
     
  • The paper "Blast: a Loosely schema-aware Meta-blocking Approach for Entity Resolution", by Giovanni Simonini, Sonia Bergamaschi and H.V. Jagadish, accepted at VLDB 2016. PDF
     
  • Big Data in Emilia. Articolo de "Il Sole 24 Ore" sul polo dei Big Data in Emila-Romagna. link


Talks

 


  • Professor Sonia Bergamaschi invited speaker at BDAA 2014 - lecture title  " Big Data Analysis: Trends & Challenges" [IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. 303 - 304SLIDES


Ongoing Projects & Collaborations

slides g
  • Big Data Exploration with Faceted Browsing:
slides g2

Publications

  • G. Simonini, S. Bergamaschi, H.V. Jagadish "Blast: a Loosely schema-aware Meta-blocking Approach for Entity Resolution", VLDB 2016 - pdf

  • S. Bergamaschi et al. "Big Data Research in Italy: A Perspective" Engineering (Journal) - pdf

  • G. Simonini and S. Bergamaschi: "Enhancing Entity Resolution Efficiency with Loosely Schema-aware Techniques", SEBD 2016

  • S. Bergamaschi G. Simonini, S. Zhu: "Enhancing Big Data Exploration with Faceted Browsing", 10th Scientific Meeting of Classification and Analysis Group (CALDAG 2015) - link
     
  • G. Simonini, S. Zhu: "Big data exploration with faceted browsing". In IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2015), Special Session on Big Data Principles, Architectures & Applications,  Amsterdam, 20-24 July 2015.

  • S. Bergamaschi, F. Guerra, G. Simonini: "Discovering the topics of a data source: a statistical approach" - SWSD Workshop @ISWC 2014 

  • M. Interlandi, G. Simonini: "Towards Declarative Imperative Data-parallel Systems" - 22nd Italian Symposium on Advanced Database Systems, SEBD 2014

  • G. Simonini, F. Guerra: "Using big data to support automatic Word Sense Disambiguation" - IEEE International Conference on High Performance Computing & Simulation, HPCS 2014  

Conference partecipations and other activities

 

2016

 


2015

  • November 2015
    • Sonia Bergamaschi will be member of the insight review panel of the SFI Research Centre: INSIGHT-Irelands Big Data and Analytics Centre in the National University of Ireland, Galway on 25-27 November 2015.
  • October 2015
    • Sonia Bergamaschi is speaker at  the conference CLADAG 2015 (8-10 October).
    • Song Zhu is teacher of the course Toolsandtechniquesformassivedataanalysis promoted by Cineca on 14-15-16 October
  • July 2015
    • Francesco Guerra e Sonia Bergamaschi are track organizers of the Second International Workshop " Big Data Principles, Architecture & Applicationds (BDAA) 2015" as part of the International Conference on High Performance Computing & Simulation (HPCS 2015): http://hpcs2015.cisedu.info/2-conference/hpsc-2015-symposia/bdaa
    • G. Simonini, S. Zhu: "Big data exploration with faceted browsing". In IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2015), Special Session on Big Data Principles, Architectures & Applications,  Amsterdam, 20-24 July 2015.
  • April 2015
    • Sonia Bergamaschi and Giovanni Simonini are teachers of the "Emerging Tools and techniques for massive data analysis" promoted by Cineca on 08-09-10 April
       
2014

 


  • July 2014 
    • Professor Sonia Bergamaschi is Session Chair at the BDAA 2014Special Session on Big Data Principles, Architectures & Applications; as part of The International Conference on High Performance Computing & Simulation (HPCS 2014and  panelist at HPCS 2014 "New Opportunities in High Performance Data Analytics (HPDA) and High Performance Computing (HPC) 
      [IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. lxiii - lxv]
    • Professor Sonia Bergamaschi is invited speaker at BDAA 2014 - lecture title  " Big Data Analysis: Trends & Challenges" [IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), pag. 303 - 304SLIDES
    • G. Simonini, F.Guerra: "Using Big Data to Support Automatic Word Sense Disambiguation". In IEEE Proceedings of  the International Conference on High Performance Computing & Simulation (HPCS 2014), Special Session on Big Data Principles, Architectures & Applications,  Bologna, 21-25 July 2014." 

 

2013
  • May 2013
    • The DBGROUP contributed to the  whitepaper “UNLEASHING THE POTENTIAL OF BIG DATA” (link)
      The Whitepaper “UNLEASHING THE POTENTIAL OF BIG DATA” (link) is based on the 2013 World Summit on Big Data and Organization Design, initiated by the Organizational Design Community (ODC) and co-sponsored by IBM. 

Copyright @  2017   DataBase Group for suggestions write to  Webmaster