blank

Data engineering beyond just managing data

2024-11-28T15:12:00+00:00

Data engineering beyond just managing data

Monday 02/12 at 10am there will be a talk by Andrea Gioia (CTO Quantica, Milan) in the context of the Foundation of Data Engineering Course at INSA

Abstract

Data, on its own, is a liability. It only becomes a valuable asset when it drives visible outcomes that align with business strategy. To achieve this, it must be enriched with the right contextual information to make it actionable. This talk explores building an information architecture that enables data product interoperability, not just syntactically, but semantically. We’ll delve into transforming data into contextualized information using metadata, and then into knowledge by connecting it to an ontology, ultimately creating a knowledge graph. Finally, we’ll discuss the benefits of this data-centric architecture for diverse AI and analysis use cases.

Bio

Andrea Gioia is a Partner and CTO at Quantyca, a consulting company specializing in data management. He is also a co-founder of blindata.io, a SaaS platform focused on data governance and compliance. With over two decades of experience in the field, Andrea has led cross-functional teams in the successful execution of complex data projects across diverse market sectors, ranging from banking and utilities to retail and industry. In his current role as CTO at Quantyca, Andrea primarily focuses on advisory, helping clients define and execute their data strategy with a strong emphasis on organizational and change management issues. Actively involved in the data community, Andrea is a regular speaker, writer, and author of ‘Managing Data as a Product,’ book. Currently, he is the main organizer of the Data Engineering Italian Meetup and leads the Open Data Mesh Initiative. Within this initiative, Andrea has published the data product descriptor open specification and is guiding the development of the open-source ODM Platform to support the automation of the data product lifecycle. Andrea is an active member of DAMA and, since 2023, has been part of the scientific committee of the DAMA Italian Chapter.

Info

The talk will be on zoom:

Zoom Link: https://insa-lyon-fr.zoom.us/j/94849567368?pwd=mwwLQSty0IUIfDzIxYLsjsd5cra6se.1

Web Excursions 2024-11-03

2024-11-03T15:12:00+00:00

This page includes some resources that I found online since the last time

Wide AI A (humble) Data Management Perspective

2022-06-06T15:12:00+00:00

Abstract

The discussion around General Artificial Intelligence is now mainstream. The recent achievements of inductive reasoning research, e.g., GPT-3 and Dall-e, have raised several questions in the academic community that span from ethics to sustainability, passing by the remaining problem of interpretability. Arguably, the issue lies in the fragmentation of different areas of AI, which trends like Neuro-Symbolic Reasoning and Knowledge-Infused Learning are trying to fix.

Stressing on the role of context, the research initiatives above are rediscovering the value of interconnected data. In these regards, the data management community is partaking the debate, supporting the development of data systems and technologies like knowledge representation, automated reasoning, and (recently) knowledge graphs. In this talk, I offer a humble data management perspective, which builds on the three pillars of data management: data (intuitively) to collect and model, questions (aka queries) to express and answer, and systems that allow storage of the former and answer the latter. I will illustrate my analysis throughout the Meme Analytics Project, an ongoing initiative that incarnates well the hardness of human-level intelligence.

Video

Recommended Resources for PhD Students

2022-06-06T15:12:00+00:00

This page includes some resources that I found during my PhD and right after. These resources helped me find my way through various obstacles that the PhD, as a journey, presents.

Books

So Good they Can’t Ignore You by Cal Newport
The Structure of Scientific Revolutions by Thomas S. Kuhn
The Elements of Style by William Strunk Jr.
Range by David Epstein
Deep Work by Cal Newport
The Biggest Bluff by Maria Konnikova

Articles

Music

For Thinking

For Coding

For Reading

Streaming All the Things

2021-02-17T17:39:00+00:00

Abstract

We organise PlayTech Talks, a series of knowledge-sharing talks on interesting topics, technological or otherwise, at Playtech for some time already. Now we have decided to broadcast already the fourth PlayTech Talk live so that everyone can join in! Playtech Talks: Streaming All the Things will be held by Riccardo Tommasini (PhD), Assistant Professor of Data Management at the University of Tartu. In recent years, the data landscape has changed. Big data are no longer a vision, and data systems evolve to support a new generation of data-intensive applications. Stream processing is playing a central role in this game where real-time decision making is a must. In this talk, Riccardo will walk you through 10 years of industrial and academic research in the area. Moreover, he will focus on *state-of-the-art data streaming platforms, i.e. Apache Kafka, Flink, and Spark *Stream Reasoning, i.e, when Stream Processing meets deductive and inductive Artificial Intelligence.

PhD, A guide for Enthusiasts

2020-12-06T15:12:00+00:00

This is the recording of a talk I gave in December 2020, during the PhD Introduction Evening at the University of Tartu. The slides of the presentation are also available here. While below your can find the list of named books, references, and resources. Some of which are also linked in this blog post.

References

Process Continuous Streams of Large Volumes of Data to Detect Conditions and Anomalies in an Instant

2020-07-04T17:39:00+00:00

Webinar on Stream Processing with Flux

DBTA report my discussion of Flux

2020-03-01T17:39:00+00:00

Webinar on Stream Processing with Flux

Brief history on Stream Processing

2020-02-07T17:39:00+00:00

Video

White Paper on Stream Processing at Influx Data

2019-08-17T17:39:00+00:00

Webinar on Stream Processing with Flux

blank

Data engineering beyond just managing data

Data engineering beyond just managing data

Abstract

Bio

Info

Web Excursions 2024-11-03

Wide AI A (humble) Data Management Perspective

Abstract

Video

Recommended Resources for PhD Students

Books

Articles

Movies

Music

For Thinking

For Coding

For Reading

Streaming All the Things

Abstract

PhD, A guide for Enthusiasts

References

Process Continuous Streams of Large Volumes of Data to Detect Conditions and Anomalies in an Instant

DBTA report my discussion of Flux

Brief history on Stream Processing

Video

White Paper on Stream Processing at Influx Data