publications
publications by categories in reversed chronological order.
2024
- Movie tag prediction: An extreme multi-label multi-modal transformer-based solution with explanationJournal of Intelligent Information Systems 2024
Providing rich and accurate metadata for indexing media content is a crucial problem for all the companies offering streaming entertainment services. These metadata are commonly employed to enhance search engine results and feed recommendation algorithms to improve the matching with user interests. However, the problem of labeling multimedia content with informative tags is challenging as the labeling procedure, manually performed by domain experts, is time-consuming and prone to error. Recently, the adoption of AI-based methods has been demonstrated to be an effective approach for automating this complex process. However, developing an effective solution requires coping with different challenging issues, such as data noise and the scarcity of labeled examples during the training phase. In this work, we address these challenges by introducing a Transformer-based framework for multi-modal multi-label classification enriched with model prediction explanation capabilities. These explanations can help the domain expert to understand the system’s predictions. Experimentation conducted on two real test cases demonstrates its effectiveness.
2023
- Exploiting Deep Learning and Explanation Methods for Movie Tag PredictionIn Proceedings of the 27th International Database Engineered Applications Symposium 2023
Indexing multimedia content with rich and accurate metadata allows for improving the quality of the search engines’ results and boosting the recommender systems performances, which can benefit from this information to yield more effective recommendation lists. Therefore, the adoption of tools able to automatically label multimedia content with informative tags represents an important task for all the companies offering streaming entertainment services. However, domain experts generally perform the tagging process manually, making it time-consuming and error-prone. In the last few years, Machine Learning techniques have been proposed as a promising solution to automate this type of task, but the lack of clean and labeled training data hinders the learning of robust classification models. To cope with the issues described above, in this work, we devised a Deep Learning based solution for semi-automatic multi-label classification integrating post-hoc explanation techniques. Specifically, model explanation methods are exploited to assist the operator in the labeling process by facilitating an understanding of the model predictions. The proposed approach has been validated on a real dataset, and the experimental results demonstrate its effectiveness.
2022
- Cascade-based echo chamber detectionIn Proceedings of the 31st ACM International Conference on Information & Knowledge Management 2022
Despite echo chambers in social media have been under considerable scrutiny, general models for their detection and analysis are missing. In this work, we aim to fill this gap by proposing a probabilistic generative model that explains social media footprints—i.e., social network structure and propagations of information—through a set of latent communities, characterized by a degree of echo-chamber behavior and by an opinion polarity. Specifically, echo chambers are modeled as communities that are permeable to pieces of information with similar ideological polarity, and impermeable to information of opposed leaning: this allows discriminating echo chambers from communities that lack a clear ideological alignment. To learn the model parameters we propose a scalable, stochastic adaptation of the Generalized Expectation Maximization algorithm, that optimizes the joint likelihood of observing social connections and information propagation. Experiments on synthetic data show that our algorithm is able to correctly reconstruct ground-truth latent communities with their degree of echo-chamber behavior and opinion polarity. Experiments on real-world data about polarized social and political debates, such as the Brexit referendum or the COVID-19 vaccine campaign, confirm the effectiveness of our proposal in detecting echo chambers. Finally, we show how our model can improve accuracy in auxiliary predictive tasks, such as stance detection and prediction of future propagations.
- The effect of people recommenders on echo chambers and polarizationIn Proceedings of the International AAAI Conference on Web and Social Media 2022
The effects of social media on critical issues, such as polarization and misinformation, are under scrutiny due to the disruptive consequences that these phenomena can have on our societies. Among the algorithms routinely used by social media platforms, people-recommender systems are of special interest, as they directly contribute to the evolution of the socialnetwork structure, affecting the information and the opinions users are exposed to. In this paper, we propose a framework to assess the effect of people recommenders on the evolution of opinions. Our proposal is based on Monte Carlo simulations combining link recommendation and opinion-dynamics models. In order to control initial conditions, we define a random network model to generate graphs with opinions, withtunable amounts of modularity and homophily. We join these elements into a methodology to study the effects of the recommender system on echo chambers and polarization. We also show how to use our framework to measure, by means of simulations, the impact of different intervention strategies. Our thorough experimentation shows that people recommenders can in fact lead to a significant increase in echo chambers. However, this happens only if there is considerable initial homophily in the network. Also, we find that if the network already contains echo chambers, the effect of the recommendation algorithm is negligible. Such findings are robust to two very different opinion dynamics models, a bounded confidence model and an epistemological model.
- Learning and Explanation of Extreme Multi-label Deep Classification Models for Media ContentIn International Symposium on Methodologies for Intelligent Systems 2022
Providing rich and accurate metadata for indexing media content is a crucial problem for all the companies offering streaming entertainment services. These metadata are typically used to improve the result of search engines and to feed recommendation algorithms in order to yield recommendation lists matching user interests. In particular, the problem of labeling multimedia content with informative tags (able to accurately describe the topics associated with such content) is a relevant issue. Indeed, the labeling procedure is time-consuming and susceptible to errors process as it is usually performed by domain experts in a fully manual fashion. Recently, the adoption of Machine Learning based techniques to tackle this problem has been investigated but the lack of clean and labeled training data leads to the yield of weak predictive models. To address all these issues, in this work we define a Deep Learning based framework for semi-automatic multi-label classification integrating model prediction explanation tools. In particular, Model Explanation techniques allow for supporting the operator to perform labeling of the contents. A preliminary experimentation conducted on a real dataset demonstrates the quality of the proposed solution.
- Towards Extreme Multi-Label Classification of Multimedia Content (Discussion Paper)In SEBD 2022
Providing rich and accurate metadata for indexing media content represents a major issue for enterprises offering streaming entertainment services. Metadata information are usually exploited to boost the search capabilities for relevant contents and as such it can be used by recommendation algorithms for yielding recommendation lists matching user interests. In this context, we investigate the problem of associating suitable labels (or tag) to multimedia contents, that can accurately describe the topics associated with such contents. This task is usually performed by domain experts in a fully manual fashion that makes the overall process time-consuming and susceptible to errors. In this work we propose a Deep Learning based framework for semi-automatic, multi-label and semi-supervised classification. By integrating different data types (e.g., text, images, etc.) the approach allows for tagging media contents with specific labels. A preliminary experimentation conducted on a real dataset demonstrates the quality of the approach in terms of predictive accuracy.
2021
- Unbiasing Collaborative Filtering for Popularity-Aware Recommendation (Discussion Paper)Luciano Caroprese, Giuseppe Manco, Marco Minici, and 2 more authorsIn SEBD 2021
We analyze the behavior of recommender systems relative to the popularity of the items to recommend. Our findings show that most popular ranking-based recommenders are biased towards popular items, thus affecting the quality of recommendation. Based on these observations, we propose a new deep learning architecture with an improved learning strategy that significantly improves the performance of such recommenders on low-popular items. The proposed technique is based on two main aspects: resampling of negatives and ensembling of multiple instances of the algorithm. Experimental results on traditional benchmark datasets show that the proposed approach substantially improves the recommendation ability by balancing accurate contributions almost independently from the popularity of the items to recommend.