URL inputted in URLShortener should go through Spam blacklist
Open, MediumPublic
Actions

Assigned To

None

Authored By

	Bugreporter
	Jul 23 2019, 9:22 PM

Description

In third-party install without proper configuration, it may be possible to circumvent Spam blacklist via this tool.

Even if in Wikimedia project where external website can not be linked, there's no technical means to prevent users from repeatly creating short URLs from e.g. en.wikipedia.org/wiki/Example's_real_name_is_John_Doe_123123 . This may be first blacklisted in Spam blacklist, and when such URL are inputted, not only creation are blocked, there will be records in spam blacklist log, so Meta sysops or stewards will block them.

I admitted that 1. this can not block such privacy-violating URLs completely, and 2. If hits are logged there will still be issues like T221072: URL shortener link creation should be logged (though this will only happen in extreme cases). Does anyone have better ideas?

Related Objects

Mentioned Here: T221072: URL shortener link creation should be logged

Event Timeline

Bugreporter created this task.Jul 23 2019, 9:22 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 23 2019, 9:22 PM

@Bugreporter: What exactly is a Security issue in this task?

"it may be possible to circumvent Spam blacklist via this tool", though not on WIkimedia wikis.

+1 to the general idea. Don't think there's a security or privacy issue here though.

In third-party install without proper configuration, it may be possible to circumvent Spam blacklist via this tool.

I'm not really worried about misconfigured wikis. In any case, the extension out of the box will automatically configure itself safely.

I admitted that 1. this can not block such privacy-violating URLs completely, and 2. If hits are logged there will still be issues like T221072 (though this will only happen in extreme cases). Does anyone have better ideas?

I think this is fine. Logging when it matches an abusive pattern isn't a privacy issue, because it's not correlating reader behavior, since the url isn't a page being read - it's abuse.

I think this is fine. Logging when it matches an abusive pattern isn't a privacy issue, because it's not correlating reader behavior, since the url isn't a page being read - it's abuse.

[This is a bit of a convoluted scenario, not sure how much weight we should give it]

Since there is no CSRF token associated with making a short url, we could have the following scenario for a malicious person (Malory) trying to find the IP address of a prominent user (Alice).

Malory sets up evil.com
Malory convinces Alice to go to evil.com via some sort of social engineering (Probably pretty easy)
evil.com records the IP address of all visits. Evil.com now has Alice's IP in it's access log, but doesn't know which hit corresponds to Alice
evil.com has javascript that makes a post request to `https://en.wikipedia.org/w/api.php?action=shortenurl&url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FSomething_triggering_spam_blacklist%2fCURRENT_TIMESTAMP_HERE
Malory now looks at https://en.wikipedia.org/wiki/Special:Log/spamblacklist for the triggering url with the unique timestamp identifier. They see Alice's name in the log beside that url. They correlate that with the timestamp from the access log for evil.com. Malory now knows Alice's IP address with reasonable certainty

URL inputted in URLShortener should go through Spam blacklistOpen, MediumPublicActions

Description

Related Objects

Event Timeline

URL inputted in URLShortener should go through Spam blacklist
Open, MediumPublic
Actions