Catálogo de publicaciones - revistas

Compartir en
redes sociales


Título de Acceso Abierto

Data Science and Engineering

Resumen/Descripción – provisto por la editorial en inglés
Data Science and Engineering (DSE) is an international, peer-reviewed, and open access journal published under the brand SpringerOpen. DSE is published in cooperation with the China Computer Federation (CCF). Focusing on the theoretical background and advanced engineering approaches, DSE aims to offer a prime forum for researchers, professionals, and industrial practitioners to share their knowledge in this rapidly growing area. It provides in-depth coverage of the latest advances in the closely related fields of data science and data engineering.
Palabras clave – provistas por la editorial

data collection; data management; big data; knowledge extraction

Disponibilidad
Institución detectada Período Navegá Descargá Solicitá
No requiere desde sep. 2024 / hasta sep. 2024 Directory of Open Access Journals acceso abierto
No requiere desde ene. 2016 / hasta sep. 2024 SpringerLink acceso abierto
open-access-logo  Esta publicación es de Acceso Abierto y no aplica cargos a los/as autores/as.

Información

Tipo de recurso:

revistas

ISSN impreso

2364-1185

ISSN electrónico

2364-1541

Editor responsable

Springer Nature

Idiomas de la publicación

  • inglés

País de edición

Alemania

Fecha de publicación

Información sobre licencias CC

https://creativecommons.org/licenses/by/4.0/

Tabla de contenidos

Explicit Behavior Interaction with Heterogeneous Graph for Multi-behavior Recommendation

Zhongping Zhang; Yin JiaORCID; Yuehan Hou; Xinlu Yu

<jats:title>Abstract</jats:title><jats:p>Multi-behavior recommendation systems exploit multi-type user–item interactions (e.g., clicking, adding to cart and collecting) as auxiliary behaviors for user modeling, which can alleviate the problem of data sparsity faced by traditional recommendation systems. The key point of multi-behavior recommendation systems is to make full use of the auxiliary behavior information for the learning of user preferences. However, there are two challenges in existing methods that need to be explored: (1) capturing personalized user preferences based on multiple auxiliary behaviors, especially for negative feedback signals; and (2) explicitly modeling the semantics between auxiliary and target behaviors, and learning the explicit interactions between multiple behaviors. To tackle the two problems described above, we propose a novel model, called explicit behavior interaction with heterogeneous graph for multi-behavior recommendation (MB-EBIH). In particular, we first construct a heterogeneous behavior graph, including both positive and negative behaviors. A pre-trained model based on graph neural network (GNN) is then used to generate explicit behavior interaction values as the edge weights for the heterogeneous behavior graph. These weights reflect the importance of each of the auxiliary behaviors in an explicit manner. Finally, the extracted explicit behavior interaction information is incorporated into the multi-behavior user–item bipartite graphs to learn better representations. Experimental results on four real-world datasets demonstrate the effectiveness of our model in terms of exploring multi-behavioral data; and ablation and analysis experiments further demonstrate the effectiveness of explicit behavior interaction information.</jats:p>

Palabras clave: Computer Science Applications; Artificial Intelligence; Information Systems; Software.

Pp. No disponible

Construct and Query A Fine-Grained Geospatial Knowledge Graph

Bo Wei; Xi Guo; Xiaodi Li; Ziyan Wu; Jing Zhao; Qiping Zou

<jats:title>Abstract</jats:title><jats:p>In this paper, we propose the fine-grained geospatial knowledge graph (FineGeoKG), which can capture the neighboring relations between geospatial objects. We call such neighboring relations strong geospatial relations (SGRs) and define six types of SGRs. In FineGeoKG, the vertices (or entities) are geospatial objects. The edges (or relations) can have “sgr” labels together with properties, which are used to quantify SGRs in both topological and directional aspects. FineGeoKG is different from WorldKG, Yago2Geo, and other existing geospatial knowledge graphs, since its edges can capture the spatial coherence among geospatial objects. To construct FineGeoKG efficiently, the crucial problem is to find out SGRs. We improve the existing geospatial interlinking algorithm in order to find out SGRs faster. To answer SGR queries efficiently, we design an index to organize the SGR edges and improve the binary join method for subgraph matching. We conduct experiments on the real datasets and the experimental results show that the proposed algorithm is more efficient than the baseline algorithms. We also demonstrate the usefulness of FineGeoKG by presenting the results of complicated spatial queries which focus on structural and semantic information. Such queries can help researchers (for example, ecologists) find groups of objects following specific spatial patterns.</jats:p>

Palabras clave: Computer Science Applications; Artificial Intelligence; Information Systems; Software.

Pp. No disponible

Where To Go at the Next Timestamp

Jiaqi DuanORCID; Xiangfu Meng; Guihong Liu

<jats:title>Abstract</jats:title><jats:p>The next Point of Interest (<jats:bold>POI</jats:bold>) recommendation is the core technology of smart city. Current state-of-the-art models attempt to improve the accuracy of the next POI recommendation by incorporating temporal and spatial intervals or by partitioning the POI coordinates into grids. However, they all overlook a detail that in real life, people always want to know where to go at an exact time point or after a specific time interval instead of aimlessly asking where to go next. Moreover, due to individual preferences, different users may visit different places at the same timestamp. Therefore, utilizing timestamp queries can enhance the personalized recommendation capability of the model and mitigate overfitting risks. These implies that using timestamp can achieve more precise recommendations. To the best of our knowledge, we are the first to use the next timestamp for next POI recommendation. In particular, we propose a <jats:bold>T</jats:bold>ime-<jats:bold>S</jats:bold>tamp <jats:bold>C</jats:bold>ross <jats:bold>A</jats:bold>ttention <jats:bold>N</jats:bold>etwork (<jats:bold>TSCAN</jats:bold>). TSCAN is a two-layer cross-attention network. The first layer, <jats:bold>T</jats:bold>ime <jats:bold>S</jats:bold>tamp <jats:bold>C</jats:bold>ross <jats:bold>A</jats:bold>ttention <jats:bold>B</jats:bold>lock (<jats:bold>TSCAB</jats:bold>), uses cross-attention between the next timestamp and historical timestamps, and multiplies the attention scores on corresponding POI to predict the next POI that is most related to the history. The other layer, <jats:bold>C</jats:bold>ross <jats:bold>T</jats:bold>ime <jats:bold>I</jats:bold>nterval <jats:bold>A</jats:bold>ware <jats:bold>B</jats:bold>lock (<jats:bold>CTIAB</jats:bold>), applies the time intervals between the next timestamp and historical timestamps to the POI obtained by TSCAB and historical POIs, allowing temporally adjacent POIs to have a greater similarity. Our model not only has a significant improvement in accuracy but also achieves the goal of personalized recommendation, effectively alleviating overfitting. We evaluate the proposed model with three real-world LBSN datasets, and show that TSCAN outperforms the state-of-the-art next POI recommendation models by 5~9%. TSCAN can not only recommend the next POI, but also recommend the possible POI to visit at any specific timestamp in the future.</jats:p>

Palabras clave: Computer Science Applications; Artificial Intelligence; Information Systems; Software.

Pp. No disponible

Efficient Top-k Frequent Itemset Mining on Massive Data

Xiaolong Wan; Xixian HanORCID

<jats:title>Abstract</jats:title><jats:p>Top-<jats:italic>k</jats:italic> frequent itemset mining (top-<jats:italic>k</jats:italic> FIM) plays an important role in many practical applications. It reports the <jats:italic>k</jats:italic> itemsets with the highest supports. Rather than the subtle minimum support threshold specified in FIM, top-<jats:italic>k</jats:italic> FIM only needs the more understandable parameter of the result number. The existing algorithms require at least two passes of scan on the table, and incur high execution cost on massive data. This paper develops a prefix-partitioning-based PTF algorithm to mine top-<jats:italic>k</jats:italic> frequent itemsets efficiently, where each prefix-based partition keeps the transactions sharing the same prefix item. PTF can skip most of the partitions directly which cannot generate any top-<jats:italic>k</jats:italic> frequent itemsets. Vertical mining is developed to process the partitions of vertical representation with the high-support-first principle, and only a small fraction of the items are involved in the processing of the partitions. Two improvements are proposed to reduce execution cost further. Hybrid vertical storage mode maintains the prefix-based partitions adaptively and the candidate pruning reduces the number of the explored candidates. The extensive experimental results show that, on massive data, PTF can achieve up to 1348.53 times speedup ratio and involve up to 355.31 times less I/O cost compared with the state-of-the-art algorithms.</jats:p>

Palabras clave: Computer Science Applications; Artificial Intelligence; Information Systems; Software.

Pp. No disponible

Special Issue Editorial on “The Innovative Use of Data Science to Transform How We Work and Live”

Yee Ling Boo; Manik Gupta; Weijia Zhang; Philippe Fournier-Viger

Palabras clave: Computer Science Applications; Artificial Intelligence; Information Systems; Software.

Pp. No disponible

A Meta-adversarial Framework for Cross-Domain Cold-Start Recommendation

Yufang Liu; Shaoqing WangORCID; Xueting Li; Fuzhen Sun

<jats:title>Abstract</jats:title><jats:p>The cold-start problem in recommender systems has been facing a great challenge. Cross-domain recommendation can improve the performance of cold-start user recommendations in the target domain by using the rich information of users in the source domain. In cross-domain cold-start recommendation, users in target domain lack sufficient historical behaviors. Existing meta-learning-based methods depend on the feature distribution of training data and limit the adaptability in new tasks. To address these issues, we propose a <jats:underline>m</jats:underline>eta-<jats:underline>a</jats:underline>dversarial <jats:underline>f</jats:underline>ramework for <jats:underline>c</jats:underline>ross-<jats:underline>d</jats:underline>omain cold-start <jats:underline>r</jats:underline>ecommendation (MAFCDR) . Specifically, we employ a multi-level feature attention mechanism for independently learning the weights of long-term and short-term features to construct preferences of users in source domain. To migrate user representations, we train a meta-adversarial network that utilizes feature embeddings in the source domain as input and enhances the robustness and stability of the model. Then, the personalized bridge function transfers the user preferences in the source domain to the target domain. We build three cross-domain tasks using Amazon dataset and conduct extensive experiments, which demonstrate the effectiveness of the proposed model in cold-start user recommendation.</jats:p>

Palabras clave: Computer Science Applications; Artificial Intelligence; Information Systems; Software.

Pp. No disponible

Decoupling Anomaly Discrimination and Representation Learning: Self-supervised Learning for Anomaly Detection on Attributed Graph

YanMing Hu; Chuan Chen; BoWen Deng; YuJing Lai; Hao Lin; ZiBin Zheng; Jing Bian

<jats:title>Abstract</jats:title><jats:p>Anomaly detection on attributed graphs is a crucial topic for practical applications. Existing methods suffer from semantic mixture and imbalance issue because they commonly optimize the model based on the loss function for anomaly discrimination, mainly focusing on anomaly discrimination and ignoring representation learning. Graph Neural networks based techniques usually tend to map adjacent nodes into close semantic space. However, anomalous nodes commonly connect with numerous normal nodes directly, conflicting with the assortativity assumption. Additionally, there are far fewer anomalous nodes than normal nodes, leading to the imbalance problem. To address these challenges, a unique algorithm, decoupled self-supervised learning for anomaly detection (DSLAD), is proposed in this paper. DSLAD is a self-supervised method with anomaly discrimination and representation learning decoupled for anomaly detection. DSLAD employs bilinear pooling and masked autoencoder as the anomaly discriminators. By decoupling anomaly discrimination and representation learning, a balanced feature space is constructed, in which nodes are more semantically discriminative, as well as imbalance issue can be resolved. Experiments conducted on various six benchmark datasets reveal the effectiveness of DSLAD.</jats:p>

Pp. No disponible

Erdos: A Novel Blockchain Consensus Algorithm with Equitable Node Selection and Deterministic Block Finalization

Buti SelloORCID; Jianming Yong; Xiaohui Tao

<jats:title>Abstract</jats:title><jats:p>The introduction of blockchain technology has brought about significant transformation in the realm of digital transactions, providing a secure and transparent platform for peer-to-peer interactions that cannot be tampered with. The decentralised and distributed nature of blockchains guarantees the integrity and authenticity of the data, eliminating the need for intermediaries. The applications of this technology are not limited to the financial sector, but extend to various areas, such as supply chain management, identity verification, and governance. At the core of these blockchains is the consensus mechanism, which plays a crucial role in ensuring the reliability and integrity of a system. Consensus mechanisms are essential for achieving an agreement amongst network participants regarding the validity of transactions and the order in which they are recorded on the blockchain. By incorporating consensus mechanisms, blockchains ensure that all honest nodes in the network reach a consensus on whether to accept or reject a block, based on predefined rules and criteria. The aim of this study is to introduce a novel consensus mechanism named Erdos, which seeks to address the shortcomings of existing consensus algorithms, such as the Proof of Work and Proof of Stake. Erdos emphasises security, decentralisation, and fairness. One notable feature of this mechanism is its equitable node-selection algorithm, which ensures equal opportunities for all nodes to engage in block creation and validation. In addition, Erdos implements a deterministic block finalisation process that guarantees the integrity and authenticity of the blockchain. The main contribution of this research lies in its innovative approach to deterministic block finalisation, which effectively mitigates the various security risks associated with blockchain systems.</jats:p>

Pp. No disponible