WikiIndex:Spam control policy: Difference between revisions

m
Hoof Hearted moved page WikiIndex:Spam Control Policy to WikiIndex:Spam control policy without leaving a redirect: Text replacement - "Spam Control Policy" to "Spam control policy"
(→‎What happens if spam slips through automated systems: ==External links== *{{Mw|Manual: Combating spam}} — at MediaWiki.org)
m (Hoof Hearted moved page WikiIndex:Spam Control Policy to WikiIndex:Spam control policy without leaving a redirect: Text replacement - "Spam Control Policy" to "Spam control policy")
 
(4 intermediate revisions by the same user not shown)
Line 1: Line 1:
{{TOCright}}
{{TOC right}}
For some thoughts about [[spambot]] hunting, see [[WikiProject:Junking bots]] <small>...didn't know where to lonk it --[[Wolf Peuker|Wolf]] | <small>[[User talk:Peu|talk]]</small> 07:05, 13 October 2007 (EDT)</small>
For some thoughts about [[spambot]] hunting, see [[WikiProject:Junking bots]] <small>...didn't know where to lonk it --[[Wolf Peuker|Wolf]] | <small>[[User talk:Peu|talk]]</small> 07:05, 13 October 2007 (EDT)</small>


Line 25: Line 25:
We maintain a local blacklist at '''[[My spam blacklist]]'''.  This is protected page that [[:Category:Active administrators of this wiki|Sysops]] can use to block offending link spam not caught by Level 1 and Level 2.  There should be very few entries here, and NONE that contain the following:
We maintain a local blacklist at '''[[My spam blacklist]]'''.  This is protected page that [[:Category:Active administrators of this wiki|Sysops]] can use to block offending link spam not caught by Level 1 and Level 2.  There should be very few entries here, and NONE that contain the following:
*Periods (full stop) '.' — periods, aka the 'full stop' have a special meaning in the regex syntax, and can cause the list to malfunction;
*Periods (full stop) '.' — periods, aka the 'full stop' have a special meaning in the regex syntax, and can cause the list to malfunction;
*TLDs (top level domains) 'com, org, net' — these appear in virtually all [[:Category:United States|United States]] URLs, and are also extensively used in many (though not all) [[:Category:Countries|countries]] around the [[:Category:Earth|world]], so provide no value to the blocking mechanism;
*TLDs (top level domains) 'com, org, net' — these appear in virtually all [[:Category:United States of America|United States]] URLs, and are also extensively used in many (though not all) [[:Category:Country|countries]] around the [[:Category:Earth|world]], so provide no value to the blocking mechanism;
*'http://www.' — the regex only checks valid URLs, so this is not necessary.
*'http://www.' — the regex only checks valid URLs, so this is not necessary.


Line 52: Line 52:


==MediaWiki spam blacklist regex==
==MediaWiki spam blacklist regex==
According to the [https://phabricator.Wikimedia.org/diffusion/ESPB/browse/master/README readme] for the [[mw:Extension:SpamBlacklist|MediaWiki spam blacklist extension]], internally a single giant regular expression is formed using the lines from the blacklist file as follows:
According to the [https://phabricator.Wikimedia.org/diffusion/ESPB/browse/master/README readme] for the {{Mw|Extension:SpamBlacklist|MediaWiki spam blacklist extension}}, internally a single giant regular expression is formed using the lines from the blacklist file as follows:


In simple terms:
In simple terms: