Involuntary Information Leakage in Social Network Services

[1] ENISA: Enisa position paper no.1, security issues and recommendations for online social networks (October 2007) http://www.enisa.europa.eu/doc/pdf/deliverables/enisa_pp_social_networks.pdf.

[2] Gross, R., Acquisti, A., Heinz III, H.: Information revelation and privacy in online social networks. In: Proceedings of the 2005 ACM workshop on Privacy in the electronic society, ACM Press New York, NY, USA (2005) 71-80

[3] Ahn, Y., Han, S., Kwak, H., Moon, S., Jeong, H.: Analysis of topological characteristics of huge online social networking services. In: Proceedings of the 16th international conference on World Wide Web, ACM Press New York, NY, USA (2007) 835-844

[4] Mislove, A., Marcon, M., Gummadi, K., Druschel, P., Bhattacharjee, B.: Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, ACM New York, NY, USA (2007) 29-42

[5] Kumar, R., Novak, J., Tomkins, A.: Structure and evolution of online social networks. In: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM Press New York, NY, USA (2006) 611-617

[6] O��Murchu, I., Breslin, J., Decker, S.: Online social and business networking communities. In: Proceedings of ECAI 2004 Workshop on Application of Semantic Web Technologies to Web Communities. (2004)

[7] Boyd, D.: Friendster and publicly articulated social networks. Conference on Human Factors and Computing Systems (CHI 2004), Vienna, Austria, April (2004) 24-29

[8] Acquisti, A.: Privacy in electronic commerce and the economics of immediate gratification. In: Proceedings of the 5th ACM conference on Electronic commerce, ACM Press New York, NY, USA (2004) 21-29

[9] Jourard, S., Lasakow, P.: Some factors in self-disclosure. Journal of Abnormal and Social Psychology 56(1) (1958) 91-98

[10] Joinson, A.N., Paine (Schofield), C. Oxford Handbook of Internet Psychology. In: Self-Disclosure, Privacy and the Internet. Oxford University Press (2007) 237-252

[11] Farmer, R.: Instant messaging-collaborative tool or educator's nightmare. In: The North American Web-based Learning Conference (NAWeb 2003). (2003)

[12] Tsai, C.H.: Common chinese names http://technology.chtsai.org/namefreq/.

[13] Tsai, C.H.: A list of chinese names http://technology.chtsai.org/namelist/.

[14] Tsai, C.H.: A review of chinese word lists accessible on the internet http://technology.chtsai.org/wordlist/.

[15] Judge, P., Alperovitch, D., Yang, W.: Understanding and reversing the profit model of spam. In: Workshop on Economics of Information Security 2005. (WEIS 2005). (June 2005)

[16] Oscar, P., VWANI, R.: Personal Email Networks: An Effective Anti-Spam Tool. IEEE Computer 38(4) (2005) 61-68

[17] Seigneur, J., Dimmock, N., Bryce, C., Jensen, C.: Combating spam with TEA (trustworthy email addresses). In: Proceedings of the Second Annual Conference on Privacy, Security and Trust (PST��04). 47-58

[18] Garcia, F., Hoepman, J., van Nieuwenhuizen, J.: Spam Filter Analysis. In: Proceedings of 19th IFIP International Information Security Conference, WCC2004-SEC, Kluwer Academic Publishers (2004)

[19] Zhang, Y., Egelman, S., Cranor, L., Hong, J.: Phinding phish: Evaluating anti-phishing tools. In: Proceedings of the 14th Annual Network and Distributed System Security Symposium (NDSS 2007). (2007)

[20] Microsoft.com: Recognize phishing scams and fraudulent e-mails http://www.microsoft.com/athome/security/email/phishing.mspx.

[21] PayPal: Phishing guide part 2 https://www.paypal.com/us/cgi-bin/webscr?cmd=xpt/cps/securitycenter/general/RecognizePhishing-outside.

[22] Wu, M., Miller, R., Garfinkel, S.: Do security toolbars actually prevent phishing attacks? In: Proceedings of the SIGCHI conference on Human Factors in computing systems, ACM Press New York, NY, USA (2006) 601-610

[23] Florêncio, D.A.F., Herley, C.: Analysis and improvement of anti-phishing schemes. In: SEC 2006. (2006) 148-157

Footnotes:

1. This work was supported in part by Taiwan Information Security Center (TWISC), National Science Council under the grants NSC 97-2219-E-001-001 and NSC 97-2219-E-011-006. It was also supported in part by Taiwan E-learning and Digital Archives Programs (TELDAP) sponsored by the National Science Council of Taiwan under NSC Grants: NSC 96-3113-H-001-010, NSC 96-3113-H-001-011 and NSC 96-3113-H-001-012.

2. Many SNSs provide a recommendation/endorsement system in which a user can "recommend" another user to the public.

3. Some rare Chinese family names are comprised of two characters, e.g., Ouhyoung; thus, a four-character full name is possible. However, to avoid false identification of real names, we only consider two- or three-character names.

Sheng-Wei Chen (also known as Kuan-Ta Chen)
http://www.iis.sinica.edu.tw/~swc
Last Update September 28, 2019


Wretch Data


Number of users	766,972 (20%)
Number of Effective users	592,548 (15%)
Number of Connections	7,619,212
Avg Connections per user	11.5


Friend Annotations and Name Candidates


Avg In-Degree	7.10
Avg In-Degree with Annot.	6.81 (96%)
Avg In-Degree with Dup. Tokens	3.46 (49%)
Avg # Unique Name Candidates	3.81


Type of name	Ratio of Name Inference


Nickname	60%
Real name	30%
First name	72%
Real name or first name	78%


Method	Real Name	First Name


Common family name	3%	N/A
Relation of first name	11%	N/A
Common full/first name	9%	57%
Relation of nickname	2%	14%
Removal of common words	27%	69%

Involuntary Information Leakage in Social Network Services

Abstract

1 Introduction

2 Related Work

3 Data Description

3.1 Wretch

3.2 Data Collection

3.3 Self-Information Disclosure

4 Involuntary Name Leakage

4.1 Inference Methodology

4.1.1 Inference of Full Names

4.1.2 Inference of First Names

4.2 Inference Results

4.3 Validation

4.4 Demographic Analysis

4.5 Risk Analysis

5 Involuntary Leakage of Age and Education Records

5.1 Inference Methodology

5.1.1 Inferring Age

5.1.2 Inferring Education Records

5.2 Inference Results

5.3 Validation

6 Discussion

6.1 Threats Caused by Name Leakage

6.2 Potential Solutions

7 Conclusion

References

Footnotes: