Self-supervised construction and recommendation techniques for e-commerce heterogeneous social graphs

ZHANG Ziqian, WANG Chaokun, FENG Hao, WU Cheng, NIU Fang

Journal of Tsinghua University(Science and Technology) ›› 2026, Vol. 66 ›› Issue (5) : 1036-1045.

PDF(1953 KB)
PDF(1953 KB)
Journal of Tsinghua University(Science and Technology) ›› 2026, Vol. 66 ›› Issue (5) : 1036-1045. DOI: 10.16511/j.cnki.qhdxxb.2026.28.009
DATABASE

Self-supervised construction and recommendation techniques for e-commerce heterogeneous social graphs

  • {{article.zuoZhe_EN}}
Author information +
History +

Abstract

[Objective] With the rapid growth of Internet e-commerce, recommendation systems have become key components for online platforms to provide efficient, accurate, and personalized user and product suggestions. These recommendations directly improve user experience, increase user retention, and drive sales growth. Incorporating social relationships into the traditional user-product interaction network has been widely proven to enhance recommendation quality and enable innovative scenarios such as friend and sharing-based recommendations. However, mainstream social e-commerce recommendation methods face significant limitations: they rely heavily on external social data from third-party platforms, which are often difficult to fully access due to privacy policies, platform restrictions, and data silo issues. Moreover, most existing solutions have only been tested on datasets with millions of users and struggle to scale to hundreds of millions due to high computational costs and limited user coverage—posing substantial barriers to their deployment on large-scale e-commerce platforms. To overcome these challenges, this study focuses on automatically building a user relationship network with clear real-world social meanings, using only internal behavioral data from e-commerce platforms without external social information. The main goals are to develop a scalable self-supervised method for inferring social relationships among large user bases, improve the efficiency of user relationship prediction at the scale of hundreds of millions, enrich the understanding of user preferences at the platform level, and expand the performance and application scope of e-commerce recommendation systems. [Methods] Based on the homophily principle from communication studies, the proposed framework includes four sequential and interrelated stages: pseudo-label network construction, user relationship inference, efficient candidate matching, and relationship type inference. First, two typical behavioral signals—co-purchase behavior and spatiotemporal co-occurrence—are extracted from e-commerce logs to build pseudo-label social networks that reflect family ties and geographic connections, respectively, serving as weak supervision signals. Next, a user relationship inference model based on multilayer perceptrons is designed to learn user representations from these networks; positive samples are obtained from observed pseudo-label edges, while negative samples are generated by random pairing of users, and the model is trained using binary cross-entropy loss. To address the high computational demand of examining all user pairs in billion-scale scenarios, an efficient candidate matching strategy based on multilevel clustering of user embeddings is proposed, significantly reducing the number of candidate pairs while maintaining high recall. Lastly, a multitask inference module is built to first predict whether a candidate pair has an actual social connection, then classify the relationship into five detailed types—senior-junior, spouse, neighbor, schoolmate, and colleague—using rules that combine pseudo-labels with user attributes such as age, gender, time, and location. [Results] Extensive experiments on real data from a large e-commerce platform (Company T) show that co-purchase relationship prediction achieves a precision of 71.70%, a recall of 87.44%, an accuracy of 76.49%, and an F1-score of 0.79. The multilevel clustering candidate matching strategy reduces computational load and supports stable online deployment at the scale of hundreds of millions of users. Relationship classification reaches high precision: 93.80% for spouses and 64.57% for senior-junior relations. The resulting heterogeneous social graph includes billions of edges across five relationship types, and online A/B tests confirm that incorporating social relationship information into recommendation models significantly improves accuracy, especially for category-sensitive items like medical products. [Conclusions] This research offers a practical solution for social e-commerce recommendations without relying on external social data, addressing privacy and platform restrictions. It enables the automatic construction of semantically rich social graphs using only internal behavioral data and supports large-scale applications through efficient clustering-based candidate matching. The proposed framework effectively incorporates social semantics into traditional recommendation systems, enhances user preference modeling, and boosts recommendation accuracy. This study not only demonstrates that the homophily principle applies to e-commerce behavior analysis but also provides scalable, interpretable methods for building large-scale social graphs and improving socially aware recommendations in real-world industry scenarios.

Key words

self-supervised learning / social network / social relationship prediction

Cite this article

Download Citations
ZHANG Ziqian, WANG Chaokun, FENG Hao, WU Cheng, NIU Fang. Self-supervised construction and recommendation techniques for e-commerce heterogeneous social graphs[J]. Journal of Tsinghua University(Science and Technology). 2026, 66(5): 1036-1045 https://doi.org/10.16511/j.cnki.qhdxxb.2026.28.009

References

[1] VAN DEN BERG R, KIPF T N, WELLING M. Graph convolutional matrix completion[EB/OL]. (2017-10-25)[2025-05-18]. https://arxiv.org/abs/1706.02263.
[2] ZHANG J N, SHI X J, ZHAO S L, et al. STAR-GCN: Stacked and reconstructed graph convolutional networks for recommender systems[C]// Proceedings of the 28th International Joint Conference on Artificial Intelligence. Macao, China: AAAI Press, 2019: 4264-4270.
[3] WANG X, HE X N, WANG M, et al. Neural graph collaborative filtering[C]// Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. Paris, France: ACM, 2019: 165-174.
[4] HE X N, DENG K, WANG X, et al. Lightgcn: Simplifying and powering graph convolution network for recommendation[C]// Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020: 639-648.
[5] YING R, HE R N, CHEN K F, et al. Graph convolutional neural networks for web-scale recommender systems[C]// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. London, United Kingdom: ACM, 2018: 974-983.
[6] HAMILTON W L, YING R, LESKOVEC J. Inductive representation learning on large graphs[C]// Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, USA: Curran Associates Inc., 2017: 1025-1035.
[7] VELI AČG KOVI AC'G P, CUCURULL G, CASANOVA A, et al. Graph attention networks[C]// Proceedings of the 6th International Conference on Learning Representations. Vancouver, Canada: OpenReview.net, 2018.
[8] BRODY S, ALON U, YAHAV E. How attentive are graph attention networks?[C]// International Conference on Learning Representations.
[9] KIM D, OH A. How to find your friendly neighborhood: Graph attention design with self-supervision[C]// International Conference on Learning Representations.
[10] JAMALI M, ESTER M. A matrix factorization technique with trust propagation for recommendation in social networks[C]// Proceedings of the Fourth ACM Conference on Recommender Systems. Barcelona, Spain: ACM, 2010: 135-142.
[11] MA H, LYU M R, KING I. Learning to recommend with trust and distrust relationships[C]// Proceedings of the Third ACM Conference on Recommender Systems. New York, USA: ACM, 2009: 189-196.
[12] TANG J L, HU X, LIU H. Social recommendation: A review[J]. Social Network Analysis and Mining, 2013, 3(4): 1113-1133.
[13] ZHU J K, MA H, CHEN C, et al. Social recommendation using low-rank semidefinite program[C]// Proceedings of the 25th AAAI Conference on Artificial Intelligence. San Francisco, USA: AAAI, 2011: 158-163.
[14] 徐上上, 孙福振, 王绍卿, 等. 基于图神经网络的异构信任推荐算法[J]. 计算机工程, 2022, 48(9): 89-95, 104. XU S S, SUN F Z, WANG S Q, et al. Heterogeneous trust recommendation algorithm based on graph neural networks[J]. Computer Engineering, 2022, 48(9): 89-95, 104. (in Chinese)
[15] 李邵莹, 孟丹, 孔超, 等. 面向社交推荐的自适应高阶隐式关系建模[J]. 软件学报, 2023, 34(10): 4851-4869. LI S Y, MENG D, KONG C, et al. Adaptive high-order implicit relations modeling for social recommendation[J]. Journal of Software, 2023, 34(10): 4851-4869. (in Chinese)
[16] 刘会, 张璇, 杨兵, 等. 用于社交推荐的增强影响扩散模型[J]. 计算机学报, 2023, 46(3): 626-642. LIU H, ZHANG X, YANG B, et al. An enhanced influence diffusion model for social recommendation[J]. Chinese Journal of Computers, 2023, 46(3): 626-642. (in Chinese)
[17] PEROZZI B, AL-RFOU R, SKIENA S. DeepWalk: Online learning of social representations[C]// Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM, 2014: 701-710.
[18] CROFT W B, LEMPEL R, MORAN S. SALSA: The stochastic approach for link-structure analysis[J]. ACM Transactions on Information Systems (TOIS), 2001, 19(2): 131-160.
[19] HU D J, HALL R, ATTENBERG J. Style in the long tail: Discovering unique interests with latent variable models in large scale social E-commerce[C]// Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM, 2014: 1640-1649.
[20] GUO D Y, XU J S, ZHANG J, et al. User relationship strength modeling for friend recommendation on Instagram[J]. Neurocomputing, 2017, 239: 9-18.
[21] CEN Y K, ZHANG J, WANG G F, et al. Trust relationship prediction in alibaba E-commerce platform[J]. IEEE Transactions on Knowledge and Data Engineering, 2020, 32(5): 1024-1035.
[22] JI H Y, ZHU J X, WANG X, et al. Who you would like to share with? a study of share recommendation in social e-commerce[C]// Proceedings of the AAAI conference on artificial intelligence. 2021, 35(1): 232-239.
[23] ZHANG J, GAO C, JIN D P, et al. Group-buying recommendation for social E-commerce[C]// Proceedings of 2021 IEEE 37th International Conference on Data Engineering (ICDE). Chania, Greece: IEEE, 2021: 1536-1547.
[24] WU J H, FAN W Q, CHEN J F, et al. Disentangled contrastive learning for social recommendation[C]// Proceedings of the 31st ACM International Conference on Information & Knowledge Management. Atlanta, USA: ACM, 2022: 4570-4574.
[25] ZHAO Z W, ZHU X, XU T, et al. Time-interval aware share recommendation via bi-directional continuous time dynamic graphs[C]// Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. Taipei, China: ACM, 2023: 822-831.
PDF(1953 KB)

Accesses

Citation

Detail

Sections
Recommended

/