Server admin log/Archive 20

June 30

16:12 mark: Temporarily added path 6939+ 14907+ to AVOID-PATHs on cr2-knams
02:53 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Sat Jun 30 02:53:46 UTC 2012
02:28 maplebed: corrected LVS pdns_recursor config error causing DNS queries to fail on LVS servers in gerrit r13554 and r13555.
02:27 logmsgbot: LocalisationUpdate completed (1.20wmf6) at Sat Jun 30 02:27:08 UTC 2012

June 29

19:49 hashar: restarting Jenkins to fix an issue with "parameterized builds" plugin. Updated git plugin as well.
19:35 RobH: dns update via authdns-update for vanadium ip
18:05 Jeff_Green: sync-apache and apache-graceful-all for http://donate.wikimedia.org-->https redirect
16:02 RobH: ms-be1001 and ms-be1002 powering down for ssd installation
15:59 RobH: authdns-update run
15:47 RobH: updating dns
15:16 mutante: dist-upgrading srv280,srv270,srv264
15:11 Jeff_Green: apache-graceful-all for redirect conf change
15:10 Jeff_Green: sync-apache to push out new foundation.conf
14:50 mark: Reinstalled chromium with precise
13:46 hashar: fixed interwiki on http://wikisource.org/ main page by hacking a script in production and refreshing cache
13:46 logmsgbot: hashar synchronized php-1.20wmf6/cache/interwiki.cdb 'Updating interwiki cache for 1.20wmf6'
13:32 logmsgbot: hashar synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
12:38 logmsgbot: dzahn synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
12:38 mutante: dumping interwiki and updating interwiki cache (to fix broken interwiki links, like wikisource.org -> wikipedia.org)
09:31 hashar: Jenkins: deployed gitsqlhaschanged patch ( d04f779 0f069c3 integration/jenkins.git )
07:56 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'send header from CS.php only for non CLI scripts 13435'
07:08 mutante: upgrading apt packages on brewster
02:51 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Fri Jun 29 02:51:13 UTC 2012
02:26 logmsgbot: LocalisationUpdate completed (1.20wmf6) at Fri Jun 29 02:26:25 UTC 2012

June 28

23:30 logmsgbot: reedy synchronized php-1.20wmf6/includes/resourceloader/ResourceLoader.php
23:15 binasher: completed aft offload_large_feedback migration on enwiki
23:03 logmsgbot: asher synchronized wmf-config/db.php 'returning db36'
21:50 logmsgbot: kaldari Finished syncing Wikimedia installation... :
21:39 logmsgbot: kaldari Started syncing Wikimedia installation... :
21:13 logmsgbot: asher synchronized wmf-config/db.php 'temp pulling db36'
20:38 binasher: ran aftv5 offload_large_feedback migrations on testwiki and en_labswikimedia
20:14 RobH: dns update for pc1-pc3
19:54 logmsgbot: kaldari Finished syncing Wikimedia installation... :
19:08 logmsgbot: kaldari Started syncing Wikimedia installation... :
18:47 hashar: the internal change to CommonSettings.php caused a lack of stylesheet for less than a minute on most wikis. I did test on test.wikipedia.org and beta project, but there must be a logic error somewhere that mess with the prod projects. Revert changes have been sent out in gerrit and merged in master.
18:35 hashar: so the nicely reviewed changes broke the enwiki stylesheets :/ reverted change :-(((
18:34 logmsgbot: hashar synchronized wmf-config/CommonSettings.php
18:33 hashar: srv190 and srv281 got ssh timeout
18:31 logmsgbot: hashar synchronized wmf-config/CommonSettings.php
18:30 hashar: did various tests using eval.php. Most important is $realm -> production. $cluster -> pmtpa. Syncing
18:25 hashar: updating mediawiki-config to grab a12545d edceb4c & eee97ad
18:23 RobHalsell: swapped bad psu out of ms1001-array3, redundant so no downtime
15:40 RobHalsell: pulling the following servers, relocating to payments rack: payments1001-1004, boron, beryllium, lithium
15:34 RobHalsell: dns updated
15:31 RobHalsell: boron appears to be unallocated, pulling IP allocation, rack allocation, moving to payments per 1227
14:40 mutante: svn server is rebooting.brb
14:38 mutante: dist-upgrading formey (svn/gerrit), rebooting soon
14:38 Jeff_Green: manganese rebooted for kernel update
14:37 RobH: allocating yttrium to payments rack per rt 1227
14:24 Jeff_Green: manganese dist-upgrade
14:15 Ryan_Lane: restarting apache on manganese
14:15 Ryan_Lane: restarting gerrit
05:08 Tim: srv266 was flooding the fatal error log, complaining about a missing file. Killed apache and ran sync-common.
05:03 Tim: fixed fatal.log on fenari, socat was writing to a deleted file
02:58 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Thu Jun 28 02:58:42 UTC 2012
02:30 logmsgbot: LocalisationUpdate completed (1.20wmf6) at Thu Jun 28 02:30:03 UTC 2012

June 27

23:50 K4-7131: sync'd payments cluster to 592e0a5ba195
23:43 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add rule for mediawiki'
22:37 K4-7131: sync'd payments cluster to 7e9072c2d571c
22:23 binasher: temporarily pulling srv211 from pybal
21:56 RobH: mw1102 has no nic0, rather than troubleshoot it for a long time, reinstall! (rt 3058)
21:01 logmsgbot: preilly synchronized php-1.20wmf6/extensions/MobileFrontend 'fix CSRF'
21:00 logmsgbot: preilly synchronized php-1.20wmf5/extensions/MobileFrontend 'fix CSRF'
20:55 RobH: db1003 back online, replaced mgmt cable and mgmt is working now as well
20:51 LeslieCarr: rebooting srv266 as it is unresponsive
20:44 RobH: db1003 mgmt issue due to bad cable, system booting back up, replacing mgmt cable
20:35 RobH: clean mysql shutdown, db1003 now offline
20:33 RobH: db1003 mgmt is not responsible, I need to remove power and reboot. confirmed iwth asher this is an s3 slave and can do a short downtime without issues
20:31 logmsgbot: maxsem synchronized php-1.20wmf5/extensions/MobileFrontend/ 'MF fixes and logging'
20:29 logmsgbot: maxsem synchronized php-1.20wmf6/extensions/MobileFrontend/ 'MF fixes and logging'
19:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'disable LastModified and LastModified/E3Experiment'
19:54 logmsgbot: reedy synchronized php-1.20wmf6/maintenance/runJobs.php
19:53 logmsgbot: reedy synchronized php-1.20wmf5/maintenance/runJobs.php
19:32 logmsgbot: reedy synchronized php-1.20wmf6/extensions/Translate/specials/SpecialLanguageStats.php
19:19 logmsgbot: reedy synchronized php-1.20wmf6/extensions/Translate/
19:06 logmsgbot: reedy synchronized php-1.20wmf6/extensions/Translate/
19:03 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: meta back to wmf6, not cause of translate issues
18:53 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: closed to 1.20wmf6
18:51 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wikimedia wikis to 1.20wmf6
18:48 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wiktionary and wikiversity to 1.20wmf6
18:47 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wikisource and wikiquote to 1.20wmf6
18:47 Jeff_Green: added several mobile hostnames to DNS for RT #2996
18:46 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wikinews and wikibooks to 1.20wmf6
18:44 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved metawiki back to 1.20wmf5
18:41 K4-713: synchronized payments cluster to fundraising/1.20 de0256084a
18:33 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved special wikis to php-1.20wmf6
18:29 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved en(wikibooks|wikinews|wikiquote|wikisource|wikiversity|wiktionary) to 1.20wmf6
18:05 RobH: cp1017 memory replaced
17:52 RobH: cp1017 is offline due to memory error. replacement memory on site, pulling system for swap
17:46 logmsgbot: preilly synchronized php-1.20wmf6/extensions/ZeroRatedMobileAccess 'update for landing page'
17:46 logmsgbot: asher synchronized wmf-config/mc.php 'disabling wgMemCachedPersistent; lowering wgMemCachedTimeout to 2x client default from 30x default'
17:45 logmsgbot: preilly synchronized php-1.20wmf5/extensions/ZeroRatedMobileAccess 'update for landing page'
17:44 logmsgbot: preilly synchronized php-1.20wmf4/extensions/ZeroRatedMobileAccess 'update for landing page'
17:19 LeslieCarr: restarting apache2 on srv258
17:11 maplebed: powercycled srv270
17:11 mutante: powercycling srv277 (had to, frozen console)
17:06 LeslieCarr: rebooting srv287
17:05 Ryan_Lane: rebooting srv280
17:03 mutante: powercycling srv280
17:01 paravoid: rebooting srv264, swapdeath
16:52 mark: Rebooting srv279, swapdeath
16:52 paravoid: rebooting srv275, swapdeath
16:50 paravoid: rebooting srv258, swapdeath
16:43 logmsgbot: reedy synchronized php-1.20wmf5/extensions/ArticleFeedbackv5/
15:08 Reedy: ExtensionDistributor now works from git on mediawiki.org
15:08 logmsgbot: reedy synchronized php-1.20wmf6/extensions/ExtensionDistributor/
15:06 logmsgbot: reedy synchronized php-1.20wmf5/extensions/ExtensionDistributor/
15:01 logmsgbot: reedy synchronized php-1.20wmf6/extensions/ExtensionDistributor/
14:51 logmsgbot: reedy synchronized php-1.20wmf6/extensions/ExtensionDistributor/ 'ED to trunk'
14:46 mark: Rebooting lvs1005 (after dist-upgrade)
13:28 logmsgbot: reedy Finished syncing Wikimedia installation... : Rebuild message cache for WikimediaShopLink
13:24 mark: Added IPv6 LVS service IPs to the LVS_import policy on cr2-eqiad, for testing with lvs1005
13:22 Tim: installing apache2.2-bin-dbgsym on mw1
13:04 logmsgbot: reedy Started syncing Wikimedia installation... : Rebuild message cache for WikimediaShopLink
13:04 mark: Started PyBal 1.02 snapshot build on lvs1005
12:39 Reedy: WikimediaShopLink is deployed to testwiki/test2wiki
11:59 apergos: kicked morebots
13.56.44 (CEST) <logmsgbot> !log reedy synchronized wmf-config/ 'WikimediaShopLink'
13.54.28 (CEST) <logmsgbot> !log reedy synchronized php-1.20wmf6/extensions/WikimediaShopLink/
13.44.37 (CEST) <logmsgbot> !log reedy synchronized php-1.20wmf6/extensions/WikimediaShopLink/
11.36.23 (CEST) <mutante> !log starting swift-container-auditor on ms-be3
10.29.51 (CEST) <mutante> !log apt-get upgrade on gallium, installs newer jenkins
10.26.41 (CEST) <mutante> !log importing jenkins_1.472_all.deb into lucid-wikimedia using reprepro
08.04.26 (CEST) <logmsgbot> !log tstarling synchronized wmf-config/CommonSettings.php
08.00.45 (CEST) <logmsgbot> !log tstarling synchronized wmf-config/CommonSettings.php
04.48.48 (CEST) <logmsgbot> !log LocalisationUpdate completed (1.20wmf6) at Wed Jun 27 02:48:51 UTC 2012
02:25 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Wed Jun 27 02:24:59 UTC 2012
01:23 Tim: on manganese: restarting gerrit
01:02 Tim: on manganese: killing all gitweb.cgi processes

June 26

23:49 Tim: on fenari: doing git and 1.19 checkouts for ExtensionDistributor
22:17 JeLuF: Slowly starting to import 100,000 images from the Deutsche Fotothek into Commons using importImages.php on fenari as user jeluf.
14:07 mutante: shutting down unused cp1037-cp1040 per RT-3189
10:48 mark: Moving all API traffic back to API apaches
10:41 mark: Restarted gmetad on nickel
10:36 apergos: powercycling srv261
02:47 logmsgbot: LocalisationUpdate completed (1.20wmf6) at Tue Jun 26 02:47:56 UTC 2012
02:25 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Tue Jun 26 02:25:02 UTC 2012
01:05 logmsgbot: preilly synchronized php-1.20wmf6/extensions/MobileFrontend 'fix css'
01:04 logmsgbot: preilly synchronized php-1.20wmf5/extensions/MobileFrontend 'fix css'
01:03 logmsgbot: preilly synchronized php-1.20wmf4/extensions/MobileFrontend 'fix css'
00:59 LeslieCarr: ignoring cp1037 to cp1040 alarms for now as they are unused
00:45 LeslieCarr: rebooting cp1040
00:45 LeslieCarr: rebooting cp1039
00:43 LeslieCarr: rebooting cp1037

June 25

22:49 logmsgbot: maxsem synchronized php-1.20wmf5/extensions/MobileFrontend/ 'Weekly MF deployment'
22:40 logmsgbot: maxsem synchronized php-1.20wmf6/extensions/MobileFrontend/ 'Weekly MF deployment'
22:36 maplebed: powercycling ms1002 - it's unresponsive to ssh and on the console though it does respond to a ping.
21:31 Jeff_Green: manual apache restart on srv265, srv277
21:21 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 37345 - Request: Enable Ext:Collection on mk.wiki'
21:07 LeslieCarr: rebooting neon
21:01 hashar: Triggered several jobs on Jenkins to run tests on change that did not received their blame stick token
21:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 37507 - Babel configuration for tl.wikipedia'
20:58 Jeff_Green: pushing out new redirects.conf adjusted for RT #3138
20:52 logmsgbot: reedy synchronized wmf-config/ 'Various site config bugs'
20:13 logmsgbot: reedy synchronized php-1.20wmf6/extensions/Math/ 'Updating math to master'
20:12 logmsgbot: reedy synchronized php-1.20wmf5/extensions/Math/ 'Updating math to master'
20:07 logmsgbot: reedy synchronized php-1.20wmf6/extensions/WikiEditor/
20:06 logmsgbot: reedy synchronized php-1.20wmf5/extensions/WikiEditor/
19:17 LeslieCarr: rebooting unresponsive gallium
18:04 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki and mediawikiwiki to 1.20wmf6
17:27 logmsgbot: reedy Finished syncing Wikimedia installation... : Take 2
16:43 logmsgbot: reedy Started syncing Wikimedia installation... : Take 2
16:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable EducationProgram on enwiki per request'
16:30 logmsgbot: reedy synchronized php-1.20wmf6/extensions/EducationProgram/ 'sync education programf iles'
16:28 logmsgbot: reedy Finished syncing Wikimedia installation... : Scapping to rebuild message cache for 1.20wmf6
16:08 logmsgbot: reedy Started syncing Wikimedia installation... : Scapping to rebuild message cache for 1.20wmf6
16:03 logmsgbot: reedy synchronized php-1.20wmf6 'Syncing php-1.20wmf6'
15:43 Reedy: Copying php-1.20wmf6 from /tmp to NFS /home on fenari
13:52 Reedy: Killed php-1.20wmf4/cache/l10n from mediawiki-installation hosts
12:26 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37699) Chage logo on uzwiki'
09:38 mutante: so the several redirects for education->outreach requested to work by today look good now. RT-3138
09:30 mutante: apache-graceful-all to push out needed redirects for education
09:23 mutante: looking good. running sync-apache
09:20 mutante: creating dsh group "testwikipedia" with just srv193, creating sync-apache-test to just sync there...testing sync
02:49 Tim: configured mediawiki-commits to discard mails from gerrit pending resolution of the "implicit destination" issue
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Mon Jun 25 02:24:46 UTC 2012

June 24

18:21 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
02:25 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Sun Jun 24 02:25:23 UTC 2012

June 23

13:28 apergos: powrcycling srv288, swap death etc, some message to mgmt console but only the timestamp so couldn't see the issue, also couldn't get past the login prompt
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Sat Jun 23 02:24:11 UTC 2012

June 22

23:11 LeslieCarr: restarted apache on srv278
22:23 binasher: stopping mysql on es3, reseeding slave via innodb hotbackup of es1004
18:57 logmsgbot: preilly synchronized docroot/bits
18:36 LeslieCarr: removing 28790 bounce messages from exim queue on mchenry
16:50 Ryan_Lane: added a database account on db9/10 for read-only access to the gerrit database
12:06 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37700) update stewardwiki logo & favicon'
11:44 mutante: installing upgrades and kernel on pdf1, can reboot? (also needs puppetizing and precise reinstall)
10:46 mutante: installing security upgrades and kernel on bast1001 (still needs reboot, but dont break user sessions)
10:42 mutante: fenari upgrade - this included replace wikimedia-lvs-realserver 0.04 (using .../wikimedia-lvs-realserver_0.08
10:41 mutante: installing security upgrades on fenari
10:31 mutante: installing security upgrades on formey (gerrit)
08:49 logmsgbot: hashar synchronized wmf-config/CommonSettings.php '12569 Load transcode conf on -e /etc/wikimedia-transcoding (wmflabs change)'
08:40 logmsgbot: hashar synchronized wmf-config/CommonSettings.php '12568 Disable wgNoticeInfrastructure on beta cluster'
08:30 logmsgbot: hashar synchronized wmf-config/CommonSettings.php '12566 labs use the same wgCentralDBname on all wiki'
07:59 apergos: powercycled lvs1001, not pingable, nothing good from mgmt console, etc.
04:11 logmsgbot: aaron synchronized php-1.20wmf5/includes/WikiPage.php
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Fri Jun 22 02:24:15 UTC 2012
01:00 logmsgbot: catrope synchronized php-1.20wmf5/extensions/VisualEditor 'VisualEditor updates'
00:37 LeslieCarr: restarting exim4 on mchenry with split_spool_directory = true
00:30 logmsgbot: aaron synchronized wmf-config/PrivateSettings.php 'Updated swift user config.'

June 21

23:11 logmsgbot: catrope synchronized php-1.20wmf5/extensions/VisualEditor/modules/ve/ce/nodes/ve.ce.TextNode.js
23:07 logmsgbot: catrope synchronized php-1.20wmf5/extensions/VisualEditor/VisualEditor.php
22:49 logmsgbot: catrope synchronized php-1.20wmf5/extensions/VisualEditor/modules/ve/ce/nodes/ve.ce.TextNode.js
22:26 logmsgbot: catrope synchronized php-1.20wmf5/extensions/VisualEditor/modules/ve/ce/nodes/ve.ce.TextNode.js
21:28 notpeter: restarting all lucene instances to direct logs to oxygen
21:16 binasher: deploying new mobile redirector to esams text squids
21:00 logmsgbot: catrope synchronized php-1.20wmf5/extensions/VisualEditor 'VisualEditor bugfixes'
20:34 logmsgbot: reedy synchronized wmf-config/
20:29 binasher: deployed new squid mobile redirector, now covers additional projects
20:27 logmsgbot: reedy synchronized wmf-config/
20:10 logmsgbot: hashar: on gallium, cloning mediawiki/extensions.git to /var/lib/jenkins/jobs/MediaWiki-Extensions-Fetching/workspace
19:25 mark: Restarted 3 queue runners as exim -qff &
19:15 logmsgbot: catrope Finished syncing Wikimedia installation... : VisualEditor updates
18:57 logmsgbot: catrope Started syncing Wikimedia installation... : VisualEditor updates
18:50 mark: Started 5 exim queue runners on mchenry with exim -qqff &
18:49 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Point $wgVisualEditorParsoidURL to cadmium'
18:36 logmsgbot: catrope synchronized php-1.20wmf5/includes/OutputPage.php 'Core patches for VisualEditor deploy'
18:36 logmsgbot: catrope synchronized php-1.20wmf5/resources/Resources.php 'Core patches for VisualEditor deploy'
18:36 logmsgbot: catrope synchronized php-1.20wmf5/resources/mediawiki.page/mediawiki.page.watch.ajax.js 'Core patches for VisualEditor deploy'
17:51 notpeter: restarting puppet on brewster
15:25 notpeter: stopping puppet on brewster
14:11 paravoid: powercycling srv272, unreachable due to load spike
05:30 binasher: clearing mobile varnish cache - my friend can't expand some article categories on his iphone after rebooting and clearing cache
04:41 logmsgbot: catrope synchronized php-1.20wmf4/extensions/LastModified/modules/lastmodified.js
04:40 logmsgbot: catrope synchronized php-1.20wmf5/extensions/LastModified/modules/lastmodified.js
02:25 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Thu Jun 21 02:25:40 UTC 2012
00:10 binasher: stopped puppet on cp1020 until tomorrow - testing new build of the squid mobile redirector on one server until tomorrow

June 20

23:29 logmsgbot: kaldari synchronized php-1.20wmf5/extensions/LastModified/E3Experiments/js/ext.E3Experiments.Timestamp.js 'updating clicktracking for LastModified and E3Experiments exts'
21:52 Reedy: pointed /usr/local/apache/common/php at /usr/local/apache/common/php-1.20wmf5 on mediawiki-installation
21:49 LeslieCarr: see RT3170 for more details on above change and mchenry pain
21:48 LeslieCarr: freezing many bounce messages on mchenry (all older than 2400 minutes)
21:04 LeslieCarr: replaced srv268 with srv245 in memcached list
21:04 logmsgbot: lcarr synchronized wmf-config/mc.php 'removed broken srv268'
21:03 paravoid: powercycling srv268; unreachable due to load spike
18:31 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
18:23 LeslieCarr: reloading mr1-pmtpa for sw upgrade (fixing a cpu bug)
18:18 logmsgbot: reedy synchronized wikiversions.dat
18:17 logmsgbot: reedy synchronized wikiversions.cdb
18:16 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Rest of pedias to 1.20wmf5
18:03 logmsgbot: preilly synchronized php-1.20wmf4/extensions/MobileFrontend 'test with disable caching on'
18:02 logmsgbot: preilly synchronized php-1.20wmf5/extensions/MobileFrontend 'test with disable caching on'
17:59 logmsgbot: preilly synchronized php-1.20wmf4/extensions/MobileFrontend 'testing with disable caching off'
17:58 logmsgbot: preilly synchronized php-1.20wmf5/extensions/MobileFrontend 'testing with disable caching off'
17:42 logmsgbot: preilly synchronized docroot/bits
17:11 logmsgbot: preilly synchronized wmf-config/mobile.php 'add Grameenphone Bangladesh'
17:08 logmsgbot: preilly synchronized wmf-config/mobile.php 'add telenor'
14:00 logmsgbot: reedy synchronized php-1.20wmf5/extensions/EducationProgram/
13:45 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable EducationProgram on enwiki *gulp*'
13:41 logmsgbot: reedy synchronized php-1.20wmf5/extensions/EducationProgram/ 'Push out master EP'
13:37 logmsgbot: reedy synchronized php-1.20wmf5/extensions/EducationProgram/ 'Push out master EP'
13:19 Reedy: Created EducationProgram database tables on enwiki
12:40 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php 'touching InitialiseSettings.php to refresh cache'
12:39 logmsgbot: hashar synchronized wmf-config/throttle.php '(bug 37740) raise account throttle for an edit marathon'
09:43 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37457) viwikibooks can import from fr/it wikibooks'
09:40 logmsgbot: hashar synchronized wmf-config/mobile.php '$wgMobileResourceVersion does not exist anymore'
09:25 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37327) Configure chr.wikipedia site logo'
04:35 Tim: on nickel: there were data sources for both "Apaches 8 CPU" and "Application servers", these were getting the same cluster name from the remote gmonds, and so different threads in gmetad were trying to write to the same summary files. Fixed temporarily, will fix in puppet shortly
04:26 Tim: on nickel: ran gmetad with -d3, it spews errors when trying to write to the faulty summary info files
04:20 Tim: on nickel: restarting gmetad
04:19 Tim: on srv258: started gmond
04:12 Tim: experimentally stopping gmond on srv258 to check for effects on oscillating appserver stats
03:23 Tim: on fenari, queueing refreshLinks jobs for some 2.8M commons image description pages that use location templates
02:48 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Wed Jun 20 02:48:41 UTC 2012
02:26 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Wed Jun 20 02:26:26 UTC 2012
02:24 Tim: started socat for /var/log/mw/fatal.log on fenari
01:23 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php

June 19

23:37 paravoid: temporarily adding wikimedia.org, wikipedia.org etc. to sodium's /etc/exim4/defer_domains
23:09 logmsgbot: maxsem synchronized wmf-config/InitialiseSettings.php 'bug 37611, plan B'
22:55 logmsgbot: maxsem synchronized wmf-config/InitialiseSettings.php 'bug 37611'
21:07 Jeff_Green: deployed a hacked up exim conf on sodium to block a mail ddos, puppet disabled there too
20:37 logmsgbot: mlitn synchronized php-1.20wmf5/extensions/ArticleFeedbackv5 'desc'
19:45 logmsgbot: mlitn Finished syncing Wikimedia installation... : Update ArticleFeedbackv5 to master
19:28 logmsgbot: mlitn Started syncing Wikimedia installation... : Update ArticleFeedbackv5 to master
19:13 logmsgbot: mlitn synchronized wmf-config/InitialiseSettings.php 'Enable AFTv4 on testwiki'
18:06 maplebed: failed out ms-be5 after failed ssd test
14:50 RobH: updating dns
14:40 RobH: db14 is out of rotation, shutting down to make room for new es servers in rack
14:03 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php 'Syncing misc changes
11:03 Ryan_Lane: adding IPs for virt6-8
10:11 logmsgbot: nikerabbit Finished syncing Wikimedia installation... : Updating TranslationNotifications
10:05 hashar: TranslationNotifications extension updated by Nikerabbit!
09:56 logmsgbot: nikerabbit Started syncing Wikimedia installation... : Updating TranslationNotifications
09:41 hashar: updating TranslationNotifications extension with NikeRabbit
09:04 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Merge I8572d5f4 to fix workflowstates in Translate'
07:27 apergos: reboot snapshot1, package and kernel updates
07:14 apergos: reboot snapshot2, package and kernel updates
02:47 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Tue Jun 19 02:47:42 UTC 2012
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Tue Jun 19 02:24:46 UTC 2012
01:34 logmsgbot: tstarling Finished syncing Wikimedia installation... :
00:53 logmsgbot: tstarling Started syncing Wikimedia installation... :
00:43 Tim: put DolphinBrowser files in docroot/bits (from preilly's Ie3fefec6) and now running scap

June 18

22:31 K4-713: updated production civicrm to r1814
22:26 logmsgbot: aaron synchronized php-1.20wmf5/extensions/FlaggedRevs 'deployed e53310f548cf3f3e4f1ddfa10f5efd0eff06eeec'
22:17 maplebed: rebooting es1002 to look at the raid setup
20:43 logmsgbot: preilly synchronized php-1.20wmf4/extensions/MobileFrontend 'update to remove bad code'
20:43 logmsgbot: preilly synchronized php-1.20wmf5/extensions/MobileFrontend 'update to remove bad code'
19:11 hashar: updating several Jenkins plugins
19:06 RobH: updating dns
18:52 logmsgbot: maxsem synchronized php-1.20wmf5/extensions/MobileFrontend/
18:51 logmsgbot: maxsem synchronized php-1.20wmf4/extensions/MobileFrontend/
18:23 logmsgbot: reedy synchronized wikiversions.dat
18:15 logmsgbot: reedy synchronized wikiversions.cdb 'sync using sync-file'
18:08 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki to 1.20wmf5
16:30 mutante: installing package upgrades on sodium
16:29 mutante: restarting lighttpd on sodium - redirecting mediawiki-cvs list page
16:05 binasher: rebooting es1001
15:59 mutante: there have been no archives, so that should be it. there may be another issue in BZ 37690 but should be unchanged by renaming
15:58 mutante: copied full config/users/passes from mediawiki-cvs to mediawiki-commits, merged redirects, added old list name to acceptable_aliases in recipient filters
15:52 mutante: making the mailing list switch. mediawiki-cvs -> mediawiki-commits
15:49 Ryan_Lane: assigned service IPs for labs-ns0/labs-ns1
15:25 binasher: rebooting es1002 and es1003
15:11 Ryan_Lane: added virt1000 as a secondary ldap server for labsconsole
15:08 Ryan_Lane: testing gerrit config with multiple ldap servers
14:52 hashar: hume is out of disk space again. Probably the wmf branches taking toooo much space
14:52 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37662) change wgUploadNavigationUrl @ dawiki'
14:21 mutante: creating new list MediaWiki-commits, not in use yet, but will replace outdated -cvs list soon
14:18 apergos: reboot snapshot3, package and kerne updates
14:13 apergos: rebooting snapshot4, kernel and other updates
09:31 Ryan_Lane: added virt1000 to dns, using titanium misc server
08:17 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php ' (bug 37672) Use odf on collection for ml projects '
06:03 apergos: powercycling db1047
02:47 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Mon Jun 18 02:47:49 UTC 2012
02:25 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Mon Jun 18 02:25:09 UTC 2012

June 17

02:45 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Sun Jun 17 02:45:15 UTC 2012
02:23 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Sun Jun 17 02:23:01 UTC 2012

June 16

02:49 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Sat Jun 16 02:49:55 UTC 2012
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Sat Jun 16 02:24:56 UTC 2012
00:34 paravoid: esams SSL should be back up
00:25 paravoid: SSL ipv6 access logs disabled; force running puppet and rm'ing access.logs on esams
00:08 paravoid: esams SSL outage, working on it

June 15

20:56 LeslieCarr: attaching asw-c1-pmtpa to asw-d-pmtpa ring
20:41 RobHalsell: updating dns for mc1-mc16 mgmt
20:07 RobH: mobing asw and msw-d3-sdtpa from single to dual power again, got sidetracked
19:59 RobH: mobing asw and msw-d3-sdtpa from single to dual power
19:24 RobH: updating dns for ms-be12 mgmt
19:18 logmsgbot: aaron synchronized php-1.20wmf5 'deployed 2755f255e45b53a083207d69c3e2d9fca62a3a1c'
19:15 paravoid: virt0: modify pdns.conf to listen on the old IP; temporarily disable puppet
19:15 paravoid: adding pre-renumbering virt0's IP back on eth1; doing policy routing to work out multihoming
18:54 RobH: updating dns for educacao redirect
18:53 LeslieCarr: deactivated rpf-filter on cr1-sdtpa and cr2-pmtpa temporarily for virt0
18:53 Ryan_Lane: doing a git pull for OpenStackManager on virt0
18:33 RobH: morebots, dont leave me again!
15:50 RobH: updating dns for new cisco machines
14:34 hashar: hume: 5.0G 5.0G 68K 100% /usr/local/apache
14:33 hashar: hume is out of disk space
14:32 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '(bug 34866) Change wgLanguageCode of several wikis to be renamed'
14:21 logmsgbot: hashar synchronized phpunit.xml
14:09 logmsgbot: hashar synchronized tests
13:39 Ryan_Lane: adding labs-ns0 and labs-ns1 dns entries
11:46 mark: csw1-esams.wikimedia.org line card 2 in trouble, power cycled it
02:48 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Fri Jun 15 02:48:46 UTC 2012
02:26 logmsgbot: LocalisationUpdate completed (1.20wmf5) at Fri Jun 15 02:26:17 UTC 2012
00:00 binasher: rebooting / upgrading kernel on es1003 first

June 14

23:58 binasher: stopping mysql on es1003 and disabled notifications. going to convert to innodb via hotbackup of es1004 for testing
23:23 logmsgbot: kaldari synchronized wmf-config/InitialiseSettings.php 'turning LastModified on for en.wiki'
23:10 logmsgbot: kaldari Finished syncing Wikimedia installation... :
23:00 logmsgbot: kaldari Started syncing Wikimedia installation... :
22:36 logmsgbot: kaldari synchronized php-1.20wmf5/extensions/LastModified/E3Experiments/js/ext.E3Experiments.Timestamp.js 'sycing E3Experiment js for wmf5'
22:34 logmsgbot: kaldari Started syncing Wikimedia installation... :
22:07 logmsgbot: kaldari Finished syncing Wikimedia installation... :
21:57 logmsgbot: kaldari Started syncing Wikimedia installation... :
21:16 RobH: updating dns for new mgmt ips and move of scs
20:37 logmsgbot: kaldari Finished syncing Wikimedia installation... :
20:33 logmsgbot: kaldari Started syncing Wikimedia installation... :
20:29 logmsgbot: bsitu Finished syncing Wikimedia installation... : Update to MoodBar
20:07 logmsgbot: bsitu Started syncing Wikimedia installation... : Update to MoodBar
19:51 logmsgbot: py synchronized wmf-config/db.php 're-add db43 to s6 pool after kern upgrade'
19:28 logmsgbot: bsitu Finished syncing Wikimedia installation... : Update to PageTriage
19:10 notpeter: rebooting db43 for kernel upgrading
19:08 logmsgbot: asher synchronized wmf-config/db.php 'returning db22'
18:58 logmsgbot: bsitu Started syncing Wikimedia installation... : Update to PageTriage
18:52 RobH: unracking and decommissioning db21 and db23
18:46 RobH: db22 relocated, powering up
18:43 RobH: db22 relocating
18:41 logmsgbot: asher synchronized wmf-config/db.php 'temp pulling db22 for hw move'
18:39 pgehres: re-enabled donation queue consumption after all updates
18:36 notpeter: pushing new dns zone file (minor change)
18:33 logmsgbot: preilly synchronized php-1.20wmf5/extensions/ZeroRatedMobileAccess 'update for landing page'
18:33 logmsgbot: preilly synchronized php-1.20wmf4/extensions/ZeroRatedMobileAccess 'update for landing page'
18:32 notpeter: new master log and pos for s6 MASTER_LOG_FILE='db47-bin.000230', MASTER_LOG_POS=876357616
18:27 logmsgbot: py synchronized wmf-config/db.php 'completed master switch for s6'
18:24 logmsgbot: preilly synchronized php-1.20wmf5/extensions/ZeroRatedMobileAccess 'update for landing page'
18:23 logmsgbot: preilly synchronized php-1.20wmf4/extensions/ZeroRatedMobileAccess 'update for landing page'
18:23 logmsgbot: py synchronized wmf-config/db.php 'switching master for s6 to db50'
18:11 Jeff_Green: erzurumi dist-upgrade & reboot [up 633 days]
18:02 logmsgbot: preilly synchronized php-1.20wmf4/extensions/MobileFrontend 'update for ie7'
18:01 logmsgbot: preilly synchronized php-1.20wmf5/extensions/MobileFrontend 'update for ie7'
18:00 Jeff_Green: aluminium/db1008 dist-upgrade & reboot
17:57 pgehres: disabled queue consumption on aluminum for dist-upgrade
17:03 mark: Copied udp-filter package from lucid-wikimedia to precise-wikimedia (but do as I say and rebuild, not as I do...)
16:40 binasher: running pagetriage_page schemea changes on enwiki and testwiki via osc (https://gerrit.wikimedia.org/r/#/c/11014/1/sql/PageTriagePagePatch.sql)
16:22 Jeff_Green: hume dist-upgrade & reboot
16:08 Jeff_Green: loudon dist-upgrade & reboot
16:04 mark: Manually disabled GRO on amslvs3/4 eth0
16:03 logmsgbot: reedy synchronized wmf-config/ 'Enable EP on test2wiki'
16:01 Ryan_Lane: restarting nova-compute on virt5
15:58 logmsgbot: reedy synchronized php-1.20wmf5/extensions/EducationProgram/
15:54 logmsgbot: reedy Finished syncing Wikimedia installation... : Rebuild localisationcache for EP
15:49 Jeff_Green: grosley dist-upgrade & reboot
15:40 Jeff_Green: silicon dist-upgrade and reboot
15:29 logmsgbot: reedy Started syncing Wikimedia installation... : Rebuild localisationcache for EP
15:28 mark: Starting dist-upgrade of manutius to Precise
15:27 Ryan_Lane: to satisfy mark's pedantry that's lucid-wikimedia
15:27 mark: Ryan needs coffee
15:26 Ryan_Lane: specifically wikimedia-lucid repo
15:26 Ryan_Lane: added adminbot 1.2 to repo
15:00 logmsgbot: reedy synchronized php-1.20wmf5/extensions/EducationProgram/
14:30 mark: Reinstalled stat1 with Ubuntu Precise
11:21 pp-pdf1: updated mwlib.rl to 0.12.12
11:21 pp-pdf2: updated mwlib.rl to 0.12.12
11:21 pp-pdf3: updated mwlib.rl to 0.12.12
02:46 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Thu Jun 14 02:46:00 UTC 2012
02:23 logmsgbot_: LocalisationUpdate completed (1.20wmf5) at Thu Jun 14 02:23:26 UTC 2012
00:01 logmsgbot_: tstarling synchronized php-1.20wmf4/includes/DefaultSettings.php

June 13

23:39 logmsgbot_: maxsem synchronized php-1.20wmf5/extensions/MobileFrontend/ 'Deploying MF fix'
23:37 logmsgbot_: maxsem synchronized php-1.20wmf4/extensions/MobileFrontend/ 'Deploying MF fix'
21:49 logmsgbot_: reedy synchronized wmf-config/ 'More changes from gerrit'
21:35 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Bringing in numerous changes merged via gerrit'
21:32 logmsgbot_: lcarr synchronized wmf-config/mc.php 'replacing broken srv203 with working srv250'
21:31 LeslieCarr: replacing srv203 with srv250 in memcache rotation since srv203 is broken
21:20 logmsgbot_: aaron synchronized php-1.20wmf5/includes/WikiPage.php 'deployed 82742bccf3b5f2da0d5df05630eb31978afbbce1'
19:57 logmsgbot_: aaron synchronized php-1.20wmf4/includes/WikiPage.php
19:51 Jeff_Green: payments cluster dist-upgrades & reboots
19:43 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Added debug log.'
19:37 logmsgbot_: aaron synchronized php-1.20wmf4/includes/WikiPage.php
19:29 binasher: drop table enwiki.trackbacks
19:28 binasher: converted enwiki.interwiki to innodb
19:27 logmsgbot_: aaron synchronized php-1.20wmf4/includes/WikiPage.php 'temporary logging code.'
19:26 binasher: drop table enwiki.exlogging
18:35 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Everything non wikipedia to 1.20wmf5
18:33 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wikiquote to 1.20wmf5
18:31 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wiktionary to 1.20wmf5
18:30 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: wikiversity to 1.20wmf5
18:22 logmsgbot_: reedy synchronized php-1.20wmf5/extensions/Vector/
18:18 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: special.dblist wikis to 1.20wmf5
18:14 logmsgbot_: aaron synchronized php-1.20wmf5/extensions/FlaggedRevs 'deployed 537bb248bb93948844f195014227512f169a439b'
18:06 Jeff_Green: db1025 dist-upgrade & reboot
17:37 logmsgbot_: preilly synchronized php-1.20wmf5/extensions/ZeroRatedMobileAccess 'changes for zero needed for carrier testing'
17:36 logmsgbot_: preilly synchronized php-1.20wmf4/extensions/ZeroRatedMobileAccess 'changes for zero needed for carrier testing'
17:35 Ryan_Lane: restarting opendj on virt0 again...
17:29 Ryan_Lane: restarting opendj again on virt0
17:24 Ryan_Lane: restarting gerrit
17:23 LeslieCarr: rebooting stat1 for wipe and reinstall into precise
16:59 Ryan_Lane: restarting opendj on virt0
16:49 Ryan_Lane: restarting mysql on virt0 with correct bind address
16:46 LeslieCarr: changing virt0's ip address and vlan
16:38 cmjohnson1: shutting down search32 to replace main board
16:24 cmjohnson1: sq48 powercycled
16:19 mutante: shut down sq33
16:18 cmjohnson1: performing hard reset on sq33
16:12 mutante: adding gerrit@wikimedia.org to accepted nonmembers of mediawiki-cvs list
15:29 mark: Unstuck torrus
14:46 Ryan_Lane: lowering ttl for virt0
14:17 Jeff_Green: storage3 dist-upgrade and reboot
12:31 mutante: backing up wikitech dir locally on linode instance
09:24 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37482) Adding Proofread Page ext. namespaces on nl.wikisource'
09:08 hashar: finished deploying my wmflabs related change. mediawiki-config is now at commit c0baf3e
09:07 logmsgbot_: hashar synchronized wmf-config
09:06 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings-wmflabs.php
09:05 logmsgbot_: hashar synchronized wmf-config/throttle.php
09:05 logmsgbot_: hashar synchronized wmf-config/mobile-wmflabs.php
09:05 logmsgbot_: hashar synchronized wmf-config/mobile.php
08:47 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php 'bug 37006 - fawiki: add Book namespace + aliases'
08:45 hashar: reverted '(bug 37482) Adding Proofread Page ext. namespaces on nl.wikisource' --> used the wrong configuration setting.
08:37 mutante: installing samba-common-bin, smbclient package upgrades on tridge
08:33 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37482) Adding Proofread Page ext. namespaces on nl.wikisource'
08:28 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php '(bug 37482) Adding Proofread Page ext. namespaces on nl.wikisource'
08:24 hashar: deploying several changes made to mediawiki-config gerrit changes 11034 11035 9131 11036 11037 9132 9136 and 9237
02:52 logmsgbot_: LocalisationUpdate completed (1.20wmf5) at Wed Jun 13 02:52:38 UTC 2012
02:27 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Wed Jun 13 02:27:37 UTC 2012

June 12

23:58 maplebed: started swift container listing loop to compare purge timing when listings are fresh
23:16 logmsgbot_: preilly synchronized php-1.20wmf5/extensions/MobileFrontend/ 'try again'
23:15 logmsgbot_: preilly synchronized php-1.20wmf4/extensions/MobileFrontend/ 'try again'
22:58 logmsgbot_: maxsem synchronized php-1.20wmf4/extensions/MobileFrontend/ 'Updating MobileFrontend'
22:53 logmsgbot_: maxsem synchronized php-1.20wmf4/extensions/ZeroRatedMobileAccess/ 'Updating ZeroRatedMobileAccess'
22:52 logmsgbot_: maxsem synchronized php-1.20wmf4/extensions/MobileFrontend/ 'Updating MobileFrontend'
22:50 logmsgbot_: maxsem synchronized php-1.20wmf5/extensions/ZeroRatedMobileAccess/ 'Updating ZeroRatedMobileAccess'
22:49 logmsgbot_: maxsem synchronized php-1.20wmf5/extensions/MobileFrontend/ 'Updating MobileFrontend'
20:28 RoanKattouw: Correction: the /usr/local/apache filesystem is full on hume, the root fs is not
20:27 RoanKattouw: hume has a full disk
20:23 RoanKattouw: Fixed ownership of php-1.20wmf{4,5}/cache/l10n , should be l10nupdate:wikidev . The wmf4 copy had wrong ownership causing rebuildLocalisationCache.php to fail for shell users (e.g. from scap)
19:40 logmsgbot_: mlitn Finished syncing Wikimedia installation... :
19:13 logmsgbot_: mlitn Started syncing Wikimedia installation... :
18:09 logmsgbot_: py synchronized wmf-config/db.php 're-adding db25 to pool after kern upgares'
18:02 notpeter: halting owa3 for repairs
16:44 logmsgbot_: py synchronized wmf-config/db.php 'removing db25 from pools for kern upgares'
16:27 logmsgbot_: py synchronized wmf-config/db.php 're-adding dbs 33, 34, 36, 50, 55, 56 to pools after kern upgares'
15:55 RobH: virt1001 and virt1002 rebooting, disregard
15:48 cmjohnson1_: shutting down search32 to run a diagnostic test
15:45 binasher: migrating enwiki.bv2009_edits (?) to innodb
15:41 binasher: migrating enwiki.moodbar_feedback to innodb
15:39 binasher: migrating enwiki.aft_article_filter_count to innodb
15:33 logmsgbot_: py synchronized wmf-config/db.php 'removing dbs 33, 34, 36, 50, 55, 56 from pools for kern upgares'
15:31 notpeter: doing another round of DB kernel upgrades
15:25 logmsgbot_: py synchronized wmf-config/db.php
15:14 logmsgbot_: asher synchronized wmf-config/CommonSettings.php 'reenabling mysql pcache'
15:00 binasher: rebooting db40
14:52 binasher: set innodb_max_dirty_pages_pct = 0 on db40 in prep for shutdown
14:48 logmsgbot_: asher synchronized wmf-config/CommonSettings.php 'disabling mysql parsercache (db40) in order to perform maintenance'
14:45 notpeter: putting kern-upgraded DBs back into pools
14:44 binasher: resumed replication on es3, es1002 after cluster23 sync completed
13:41 Jeff_Green: added awight to fr-tech@wikimedia.org email alias
12:06 mutante: gerrit create-project --name=mediawiki/extensions/UniversalLanguageSelector --parent=mediawiki/extensions
11:28 mutante: powercycling downed srv232 (also cause for check_all_memcached crit)
11:08 mutante: powercycled mw1042 to check for hardware issues and fscked. appears to be just unused (though down since ~3d like mw1071 per nagios)
10:37 mutante: test to show linking from !log via SAL to RT: RT:3100 (before/without template)
10:31 hashar: incubatorwiki.translate_messageindex on db39 uses MyISAM engine. See RT #3100
10:25 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php 'bug 37391 , take 2 - Install Translate extension on be.wikimedia.org'
10:24 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php 'bug 37391 , take 2 - Install Translate extension on be.wikimedia.org'
10:23 hashar: Compared translate% tables schema on bewikimedia with incubatorwiki. diff prove they are the same so the schema changes made early are successful.
10:17 hashar: bewikimedia (db39) : dropped tables translate_tmf , translate_tms and translate_tmt I have incorrectly added
09:59 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php 'revert translate extension on be.wikimedia.org, need DB update'
09:58 logmsgbot_: hashar synchronized wmf-config/InitialiseSettings.php 'bug 37391 - Install Translate extension on be.wikimedia.org'
06:24 apergos: db1047 looks like the aft_article_filter_count is missing a few rows compared to the master (after replication caught up), presumably this is a side effect of the repair, have pinged binasher for help, leaving everything running and hope it's tolerable error for a day
02:34 logmsgbot_: LocalisationUpdate completed (1.20wmf5) at Tue Jun 12 02:34:49 UTC 2012
02:25 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Tue Jun 12 02:25:21 UTC 2012
01:35 binasher: passes the dba mantel to notpeter
01:22 notpeter: removing one slave from each db shard to upgrade/restart
00:24 binasher: shutdown mysql on es3. stopped slaving on es1002, rsyncing cluster23 tables to es3
00:09 binasher: pointed es3 to MASTER_LOG_FILE='es1-bin.000788', MASTER_LOG_POS=453509865
00:05 binasher: es3:~# rm -rf /usr/local/mysql*

June 11

23:54 logmsgbot_: asher synchronized wmf-config/db.php 'fully commenting out es3'
23:52 logmsgbot_: asher synchronized wmf-config/db.php 'making es1 the master for blobs cluster 23'
23:51 binasher: es1 is the new master, now switching mw conf
23:48 binasher: preparing to switch es master to es1
23:31 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 27706 - enable RSS extension on uawikimedia'
23:12 logmsgbot_: reedy Finished syncing Wikimedia installation... :
21:49 Reedy: Applied PageTriage schema updates to testwiki and enwiki
20:54 logmsgbot_: reedy Started syncing Wikimedia installation... :
20:07 logmsgbot_: reedy synchronized php-1.20wmf5/
19:20 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mediawikiwiki to 1.20wmf5
18:54 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki to 1.20wmf5 also
18:52 logmsgbot_: reedy synchronized php-1.20wmf5/ 'Now we have some more space...'
18:50 logmsgbot_: reedy synchronized php-1.20wmf3/cache/l10n/ 'Kill l10ncache for php-1.20wmf3 as its not needed'
18:45 logmsgbot_: reedy synchronized php-1.20wmf2/cache/l10n/ 'Kill l10ncache for php-1.20wmf2 as its not needed'
18:44 logmsgbot_: reedy synchronized php-1.20wmf5/ 'Scap is taking an age, just ensure deployment files are in sync'
18:42 binasher: resuming coversion of es1004 to innodb, using compact row format after testing dynamic and compressed
18:03 logmsgbot_: reedy Started syncing Wikimedia installation... : Consistency
17:56 Ryan_Lane: enabling TitleBlacklist on labsconsole
17:28 notpeter: moving /usr/local/apache to /a/apche with symbolic link on searchidx1001 as a temp measure until it can be reimaged
16:58 logmsgbot_: reedy Started syncing Wikimedia installation... : Running scap to ensure consistency
16:53 logmsgbot_: reedy synchronized php-1.20wmf5/cache/l10n/
16:28 mutante: installing security upgrades on sodium
16:19 logmsgbot_: reedy Started syncing Wikimedia installation... : Rebuild messagecache for 1.20wmf5
16:17 logmsgbot_: reedy synchronized wmf-config/ExtensionMessages-1.20wmf5.php
16:02 logmsgbot_: reedy Started syncing Wikimedia installation... : Rebuild messagecache for 1.20wmf5
15:55 logmsgbot_: reedy Started syncing Wikimedia installation... : Rebuild messagecache for 1.20wmf5
15:54 logmsgbot_: reedy Finished syncing Wikimedia installation... : Rebuild messagecache for 1.20wmf5
15:46 mutante: hume /usr/local/apache is out of disk (just 5GB but more branches now). (LVM vg "tank" lv "tank-apache" ) but no free extents. could take from /archive but unsure about shrinking the xfs.
15:35 logmsgbot_: reedy Started syncing Wikimedia installation... : Rebuild messagecache for 1.20wmf5
15:30 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: test2wiki to 1.20wmf5
15:28 logmsgbot_: reedy synchronized php-1.20wmf5/
15:20 Reedy: running sync-dir php-1.20wmf5
15:13 Reedy: Copying checkout of 1.20wmf5 onto NFS
14:53 mutante: running puppet on stat1. installs plotting packages
09:56 apergos: shut down mysqld on db1047, reparing tables
06:54 pp-pdf2: upgraded mwlib to 0.13.8
06:54 pp-pdf3: upgraded mwlib to 0.13.8
06:54 pp-pdf1: upgraded mwlib to 0.13.8
02:24 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Mon Jun 11 02:24:06 UTC 2012

June 10

02:22 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Sun Jun 10 02:22:33 UTC 2012

June 9

07:45 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Sat Jun 9 07:45:17 UTC 2012
02:35 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Sat Jun 9 02:35:29 UTC 2012
02:25 Reedy: Running LU manually
02:14 Reedy: Cleared a bit of space on fenari by deleting checkouts from /tmp
02:00 logmsgbot_: LocalisationUpdate failed: git pull of core failed
01:51 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files:
01:50 logmsgbot_: reedy synchronized wikimedia.dblist
01:49 Reedy: fenari has a full /
01:49 logmsgbot_: reedy synchronized all.dblist

June 8

20:32 Reedy: Updated php to point to php-1.20wmf4 rather than php-1.20wmf3
20:09 logmsgbot_: reedy synchronized wikimedia.dblist
20:08 logmsgbot_: reedy synchronized all.dblist
20:04 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: 2 wikimedia wikis to 1.20wmf4
19:57 logmsgbot_: reedy Finished syncing Wikimedia installation... : Rebuilding localisation cache for message updates
19:39 logmsgbot_: reedy Started syncing Wikimedia installation... : Rebuilding localisation cache for message updates
18:47 logmsgbot_: reedy synchronized php-1.20wmf4/languages/messages/ 'Pushing out updated files upon siebrands request'
16:30 cmjohnson1: shutting down search32 to swap DIMM around
10:19 notpeter: ganglia down, restarting apache on nickel.
09:32 notpeter: stopping indexing on searchidx1001 to re-copy to searchidx2
08:45 notpeter: reimaging searchidx2 with correct partitioning
06:50 Tim: on cp1001: disabled HTCP plugin in gmond for testing, seems to work so I will disable it properly
03:55 Tim: disabled LastModified extension due to overload on cp1005
03:51 logmsgbot_: tstarling synchronized wmf-config/InitialiseSettings.php
03:18 Tim: restarting squid on cp1005, maybe out of FDs or something, cachemgr shows exactly 1000 open connections to 10.2.1.1
03:08 Tim: stopped gmond on cp1001 with kill -STOP for memory leak debugging
03:02 Tim: on cp1002: killed gmond again, it was leaking memory again, already up to 27GB in the few minutes since I restarted it
03:01 Tim: on fenari: copied *.text and *.upload from /home/wikipedia/conf/squid/generated/clusters to /etc/dsh/group
02:56 Tim: cp1001: same as on cp1002, restarted gmond
02:53 Tim: on cp1002: killed gmond, which was using 100% CPU and 23GB RSS. Restarting squid which had died
02:23 logmsgbot_: LocalisationUpdate completed (1.20wmf3) at Fri Jun 8 02:23:09 UTC 2012
02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Fri Jun 8 02:14:55 UTC 2012

June 7

23:32 Tim: deploying varnish configuration change https://gerrit.wikimedia.org/r/#/c/10672/ on cp1041, cp1042, cp1043, cp1044
20:46 logmsgbot_: kaldari synchronized php-1.20wmf4/extensions/LastModified 'syncing LastModified extension'
20:32 logmsgbot_: kaldari synchronized wmf-config/InitialiseSettings.php 'turning on LastModified and E3Experiemnts for en.wiki'
19:54 logmsgbot_: kaldari synchronized php-1.20wmf4/extensions/LastModified/E3Experiments/js/ext.E3Experiments.Timestamp.js 'syncing js file for E3Experiments'
18:58 cmjohnson1_: shutting down search32 for testing
16:52 Ryan_Lane: added ldap automount entries for /public/datasets and /public/keys
02:19 logmsgbot_: LocalisationUpdate completed (1.20wmf3) at Thu Jun 7 02:19:52 UTC 2012
02:11 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Thu Jun 7 02:11:49 UTC 2012

June 6

22:29 logmsgbot_: aaron synchronized wmf-config/swift.php 'deployed 7dc77e431310580da0dbd368b8b290a293e3ee21'
19:55 cmjohnson1: shutting down search32 to reseat DIMM B2
19:18 cmjohnson1: shutting down storage3
18:05 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved remaining wikis to 1.20wmf4
17:02 Jeff_Green: mailman 'site' password changed per RT 3039
11:12 mark: Added AAAA record to mobile
10:38 mark: Wikipedia is IPv6-enabled.
10:37 mark: Added AAAA records to all non-mobile wiki projects
10:19 mark: Added AAAA record to bits.wikimedia.org
10:02 Ryan_Lane: repooling ssl3001
10:00 mark: Added AAAA record to upload.wikimedia.org
09:51 Ryan_Lane: depooling ssl3001
08:41 mark: Converted bits.wikimedia.org into a direct geodns record, removed the old bits -> bits-geo CNAME
07:53 mark: Converted geoiplookup.wikimedia.org into a separate, IPv4-only geodns record
07:37 pp-pdf1: installed tmpreaper cronjob for /home/pp/mathcache directory
07:37 pp-pdf2: installed tmpreaper cronjob for /home/pp/mathcache directory
07:37 pp-pdf3: installed tmpreaper cronjob for /home/pp/mathcache directory
02:35 logmsgbot_: LocalisationUpdate completed (1.20wmf3) at Wed Jun 6 02:35:47 UTC 2012
02:11 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Wed Jun 6 02:10:58 UTC 2012

June 5

22:37 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Undo temporary woff whitelisting'
22:34 logmsgbot_: aaron synchronized php-1.20wmf4/includes/filerepo/file/LocalFile.php 'deployed 4791e3d25aebe9643a7cea91f2eb49e6b54593c5'
22:33 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Temporarily allow uploading woff files on slwikisource'
22:13 logmsgbot_: kaldari synchronized php-1.20wmf4/extensions/PageTriage/modules/ext.pageTriage.models/ext.pageTriage.article.js 'updating default PageTriage filters'
20:56 logmsgbot_: mmullie Finished syncing Wikimedia installation... :
20:16 logmsgbot_: mmullie Started syncing Wikimedia installation... :
19:49 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Fix notice in MobileFrontend config'
19:45 pp-pdf2: updated mwlib to 0.13.7-1-g827780b
19:45 pp-pdf1: updated mwlib to 0.13.7-1-g827780b
19:45 pp-pdf3: updated mwlib to 0.13.7-1-g827780b
19:45 pp-pdf2: updated mwlib to 0.13.7-1-g827780b
18:45 notpeter: starting innobackupex dump from blondel to bellin
18:44 notpeter: starting indexing on new searchidx2
18:39 mark: Added static routes 2002::/16 and 2001::/32 for 6to4 and teredo on the Tampa routers; these are redistributed in OSPF to eqiad
18:29 notpeter: restart indexing on searchidex1001
18:20 mark: Replaced static LVS IPv6 routes with correct next-hops on cr1-eqiad and cr2-eqiad
18:09 mark: Redistributing static routes in OSPF on cr1-eqiad and cr2-eqiad
18:04 paravoid: rebooting capella to make sure things work after a reboot
17:53 mark: Redistributed statics in OSPF3 on csw1-esams
17:14 logmsgbot_: aaron synchronized php-1.20wmf4/includes/logging/LogEventsList.php 'deployed d9f146ac42f2884e76390d6bc979eb10032adf7f'
17:09 mark: Added uRPF exception for 6to4 traffic on all routers
17:05 jeremyb: (UTC) 23:42:14 <binasher> !log re-enabled es4 monitoring. its currently our only es server without any tables marked as crashed / needing recovery, myisam recovery has been absent for all systems since the ms servers were migrated off of in nov 2011. (Sum of human knowledge * Renyi entropy = ES)
16:52 mark: Pooled ssl1001
15:44 logmsgbot_: asher synchronized wmf-config/db.php 'returning es2 to service'
15:25 paravoid: rebooting lvs1004 and reinstalling with precise
15:17 binasher: rebooting es2 for kernel + mysql upgrade
15:16 logmsgbot_: asher synchronized wmf-config/db.php 'pulling es2 for kernel+mysql upgrades'
14:56 paravoid: rebooting amslvs3 & amslvs4 to reinstall with precise
14:20 paravoid: rebooting lvs1006 to reinstall with precise
13:51 cmjohnson1: shutting down bellin to replace main board
13:49 notpeter: reimaging db1042
13:40 paravoid: rebooting lvs1005 to reinstall with precise
12:59 paravoid: rebooting lvs2 to reinstall with precise
12:47 Ryan_Lane: changing capella's subnet in DNS
12:10 Ryan_Lane: rebuilding capella as precise
10:01 logmsgbot_: asher synchronized wmf-config/db.php 'putting es1 in production'
09:53 notpeter: cancel that, it's mid-cron. will do later
09:52 notpeter: stopping indexing on searchidx1001 to rsync to searchidx2
09:35 binasher: rebooting es1 for kernel+mysql upgrade. dont need to pull from db.php because it was never correctly added or queried?
09:14 mark: Built PyBal 1.01 for precise, and included it in the precise-wikimedia APT repository
08:45 binasher: restarted mysql on es1004 with default innodb file format as barracuda
08:31 notpeter: reimaging searchidx2
02:36 logmsgbot_: LocalisationUpdate completed (1.20wmf3) at Tue Jun 5 02:36:44 UTC 2012
02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf4) at Tue Jun 5 02:14:17 UTC 2012

June 4

23:23 logmsgbot_: asher synchronized wmf-config/db.php 'returning es4'
22:49 binasher: started an experiment on es1004 - altering all es tables from myisam to innodb one at a time with file_per_table enabled
22:39 binasher: stopping mysql on es4. all tables marked as having repair fails are in cluster22, resyncing just those from es1002
21:52 logmsgbot_: asher synchronized wmf-config/db.php 'pulling es4 again'
21:43 logmsgbot_: asher synchronized wmf-config/db.php 'returning es4 to service'
21:16 Ryan_Lane: restarting nginx on all ssl boxes again
21:07 Ryan_Lane: force running puppet on all ssl hosts again
21:04 Ryan_Lane: repooling ssl1, ssl1001, ssl3001
21:03 Ryan_Lane: restarting nginx on all ssl hosts
20:23 binasher: rebooted es4
20:18 logmsgbot_: asher synchronized wmf-config/db.php 'pulling es4 for post-crash upgrade'
19:55 logmsgbot_: kaldari synchronized php-1.20wmf4/extensions/PageTriage/PageTriage.hooks.php 'syncing PageTriage.hooks.php'
19:32 Ryan_Lane: force running puppet on ssl servers
18:22 Reedy: Nuked php-1.20wmf4 on mw64 then ran sync-common. Seems to have dealt with the permission errors
18:11 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki to 1.20wmf4
16:48 binasher: upgraded kernel on db1047 / analytics
16:09 Ryan_Lane: restarting ircecho on manganese
15:21 paravoid: reinstalling lvs1 with precise
15:13 mark: Added new IPv6 LVS prefixes to all routers for uRPF filters; BGP import filters still need adjusting for dual-family sessions
15:08 cmjohnson1: physically power cycling lvs1
15:02 Ryan_Lane: depooling ssl1001 and ssl3001
14:55 Ryan_Lane: disabling puppet on all ssl hosts
13:27 mark: Changed upload.esams.wikimedia.org CNAME to upload-lb.esams, effectively disabling the IPv6 selective answer script
12:23 mark: Upgrading wikimedia-lvs-realserver to version 0.08 across the cluster (by Puppet)
12:18 Ryan_Lane: depooling ssl1
11:32 mark: Copied wikimedia-lvs-realserver 0.08 from APT distribution precise-wikimedia to lucid-wikimedia
02:38 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Mon Jun 4 02:38:15 UTC 2012
02:14 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Mon Jun 4 02:14:45 UTC 2012

June 3

15:45 paravoid: aborting lvs1 install, partition map is not ready; putting it back to production as-is
15:31 paravoid: reinstalling lvs1 with precise
15:10 RobH: torrus failed to refresh via puppet (failed refresh takes too long) so manually running the refresh/rebuild command as puppet copied the updates to the system
14:55 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 37237 - Change Wikisource namespace for Tamil wikisource'
14:49 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 37211 - Set $wgUseCombinedLoginLink = false'
14:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 37294 - Add English Wikibooks as import source at Vietnamese Wikibooks'
14:07 logmsgbot: midom synchronized wmf-config/db.php
13:10 logmsgbot: hashar synchronized wmf-config/InitialiseSettings.php '59753a9 (allow bureaucrats on frwiki to add+remove accountcreator group'
13:09 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'Commits: 6bef518 (wgHTCPMulticast only used on production cluster) and 882dd69 (wgLoadScript only used on production) -- was not correctly deployed earlier'
13:02 logmsgbot: midom synchronized wmf-config/db.php
10:18 hashar: mw64: rsync: write failed on "/apache/common-local/wmf-config/CommonSettings.php": No space left on device (28)
10:17 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'Commits: 6bef518 (wgHTCPMulticast only used on production cluster) and 882dd69 (wgLoadScript only used on production)'
09:16 notpeter: pushing new zone files. only minor changes
02:35 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Sun Jun 3 02:35:31 UTC 2012
02:13 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Sun Jun 3 02:13:39 UTC 2012

June 2

15:37 hashar: We ran out of beer, see bug 37307
15:06 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Temporarily enable $wmgReduceStartupExpiry on testwiki for Berlin tutorial'
14:12 logmsgbot: hashar synchronized wmf-config/CommonSettings.php
14:10 logmsgbot: hashar synchronized wmf-config/ext-wmflabs.php
14:10 hashar: deploying some nasty configuration changes in wmf-config
14:10 logmsgbot: hashar synchronized wmf-config/ext-pmtpa.php
14:04 logmsgbot: reedy synchronized wmf-config/proofreadpage.php 'Default proofreadpage-showheaders to 1 on enwikisource and svnwikisource'
13:46 mutante: rebuilding archives for fd-advisorygroup mailing list
13:21 RobHalsell: updated quotas on labstore1 for publicdata-proect
08:57 logmsgbot: reedy synchronized wmf-config/ 'https://gerrit.wikimedia.org/r/#/c/9717/'
02:36 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Sat Jun 2 02:36:20 UTC 2012
02:14 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Sat Jun 2 02:14:25 UTC 2012

June 1

20:54 logmsgbot: reedy synchronized php-1.20wmf4/includes/specials/SpecialRecentchanges.php
20:53 logmsgbot: reedy synchronized php-1.20wmf4/includes/SpecialPage.php
20:28 logmsgbot: reedy synchronized php-1.20wmf4/extensions/ExtensionDistributor/
20:27 logmsgbot: reedy synchronized php-1.20wmf3/extensions/ExtensionDistributor/
19:37 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36805 - Enable NewUserMessage extension on mrwiki and mrwikisource'
19:34 logmsgbot: reedy synchronized wmf-config/ 'Fix some typos'
19:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36965 - Please setup Collection extension on Telugu Wikipedia'
19:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 37027 - Install Collection extension in Hebrew Wiktionary'
19:15 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Enabling collection on tawikis'
19:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enabling collection on tawikis'
17:56 logmsgbot: reedy synchronized wmf-config/
16:50 logmsgbot: reedy Finished syncing Wikimedia installation... : Testing
16:23 logmsgbot: reedy Started syncing Wikimedia installation... : Testing
15:33 Reedy: pointing php to php-1.20wmf3
13:22 logmsgbot: maxsem synchronized php-1.20wmf3/extensions/MobileFrontend/ 'Deploying MF fixes'
13:21 logmsgbot: maxsem synchronized php-1.20wmf4/extensions/MobileFrontend/ 'Deploying MF fixes'
12:19 Reedy: Purged www.mediawiki.org/xml/export-0.7.xsd
12:17 logmsgbot: reedy synchronized docroot/mediawiki/xml/export-0.7.xsd 'Push out updated version of export-0.7.xsd'
06:45 apergos: reboot dataset2, kernel update and security updates
02:33 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Fri Jun 1 02:33:36 UTC 2012
02:11 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Fri Jun 1 02:11:08 UTC 2012

May 31

23:40 logmsgbot: kaldari Finished syncing Wikimedia installation... : scapping for new LastModified and E3Experiments extensions
23:14 logmsgbot: kaldari Started syncing Wikimedia installation... : scapping for new LastModified and E3Experiments extensions
23:09 logmsgbot: aaron synchronized multiversion/ 'Updating multiversion code to head.'
22:25 logmsgbot: kaldari Started syncing Wikimedia installation... :
22:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Re-enable randomrootpage again'
22:04 logmsgbot: aaron synchronized wmf-config/swift.php 'Purge from squid all thumbs in Swift on purge.'
21:50 logmsgbot: reedy synchronized php-1.20wmf4/includes/specials/SpecialLog.php
21:50 logmsgbot: reedy synchronized php-1.20wmf4/includes/logging/LogEventsList.php
20:46 pp-pdf2: changed tmpreaper params in crontab, delete files after 16 hours instead of 48
20:46 pp-pdf3: changed tmpreaper params in crontab, delete files after 16 hours instead of 48
20:46 pp-pdf1: changed tmpreaper params in crontab, delete files after 16 hours instead of 48
20:38 logmsgbot: lcarr synchronized wmf-config/mc.php
20:00 LeslieCarr: powering down mw32 for maintenance
19:59 LeslieCarr: powering down mw30 for maintenance
19:57 LeslieCarr: powering off mw31
17:32 LeslieCarr: rebooting mw1135 for kernel upgrade
17:24 cmjohnson1: running memtet on mw64
17:14 LeslieCarr: rebooting unresponsive mw1143
17:13 LeslieCarr: rebooting unresponsive mw1135
17:10 LeslieCarr: rebooted mw1091 for kernel upgrade
17:09 LeslieCarr: rebooted ms1004 for kernel upgrade
17:08 LeslieCarr: rebooted mw1102 because it thinks it has no eth0
17:05 LeslieCarr: rebooted mw1091 due to being unresponsive
17:01 LeslieCarr: rebooted ms1004 due to it being unresponsive
17:01 LeslieCarr: rebooted ms1004
14:28 binasher: pulling srv199 from lvs again for further experimentation
14:03 binasher: returning srv199 to lvs (dialed back to slow / no longer running php 5.4)
13:40 maplebed: upgrading and rebooting eqiad es hosts due to 210 day kernel bug thingy.
13:04 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bring in a few shell requests'
12:46 logmsgbot: reedy synchronized php-1.20wmf4/languages/Language.php '(bug 36839) Use mb_check_encoding() if available'
12:45 logmsgbot: reedy synchronized php-1.20wmf3/languages/Language.php '(bug 36839) Use mb_check_encoding() if available'
10:47 binasher: temporarily pulled srv199 from lvs for php testing
10:44 Reedy: reedy synchronized php-1.20wmf4/extensions/UploadWizard/ 'Push trunk UW to cluster'
10:43 Reedy: reedy synchronized php-1.20wmf3/extensions/UploadWizard/ 'Push trunk UW to cluster'
02:32 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Thu May 31 02:32:48 UTC 2012
02:17 logmsgbot: kaldari synchronized wmf-config/InitialiseSettings.php 'syncing InitialiseSettings to disable PageTriage on test2'
02:10 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Thu May 31 02:10:43 UTC 2012
00:26 logmsgbot: awjrichards synchronizing Wikimedia installation... :

May 30

23:01 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php
22:22 logmsgbot: awjrichards synchronizing Wikimedia installation... : Weekly MobileFrontend deployment and picking up ZeroRatedMobileAccess i18n changes
21:50 logmsgbot: kaldari synchronized php-1.20wmf4/extensions/CentralNotice 'deploying 98db6a177df977a699576da9688588c77bf81b04'
21:30 K4-713: Synchronized payments cluster to DonationInterface 43a457e56d
21:16 LeslieCarr: rebooted db1044 (unresponsive server)
21:10 LeslieCarr: rebooted db1031 (unresponsive server)
21:08 LeslieCarr: rebooted db1029 (unresponsive server)
21:07 LeslieCarr: restarted db1026 (unresponsive server)
21:03 LeslieCarr: restarted db1012 (unresponsive server)
20:57 LeslieCarr: restarted networking on cp1036
19:34 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable randomrootpage on wikibooks and wikisources'
19:19 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: 27 more of the misc wikis to 1.20wmf4
19:15 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Some more of the misc wikis to 1.20wmf4
19:11 LeslieCarr: rebooting neon
18:55 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All special wikis to wmf4
18:47 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikisource, wikiversity, wiktionary to wmf4
18:45 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikibooks, wikinews, wikiquote to wmf4
18:29 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Commonswiki to wmf4
18:16 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved EN, non-wikipedia, non-special, sites to wmf4
18:02 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Fix capitalisation'
18:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable random root page on testwiki'
17:58 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Add enabling code for randomrootpage'
17:53 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add setting for random root page'
17:47 LeslieCarr: cleared mobile varnish cache
17:45 ssmollett: ganglia uploaded backported ganglia 3.3.5 deb package to precise-wikimedia repository
17:45 ssmollett: ganglia uploaded backported ganglia 3.3.5 deb package to precise-wikimedia repo
17:45 ssmollett: ganglia uploaded backported ganglia 3.3.5 deb package to precise-wikimedia repo
17:40 logmsgbot: aaron synchronized php-1.20wmf4/extensions/PageTriage 'Switched to wmf4 extension branch to get 0be1787634613a36439b760d6d5f0639724f8a7b'
16:06 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'subpages for frwikibooks'
12:00 mutante: restarting pdns on ns2
11:41 mutante: running authdns-update to push analytics1011 to 1022 entries
06:05 logmsgbot: hashar synchronized docroot/mediawiki/xml 'bug 37111 deploying export-0.7.xsd'
02:37 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Wed May 30 02:37:00 UTC 2012
02:23 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Wed May 30 02:23:47 UTC 2012

May 29

21:09 logmsgbot: aaron synchronized wmf-config/swift.php 'Enabled new thumb purge hook on remaining wikis'
20:49 LeslieCarr: renaming analytics1001.eqiad.wmnet to analytics1001.wikimedia.org
20:25 pp-pdf2: restarted services
20:25 pp-pdf3: restarted services
20:25 pp-pdf1: restarted services
20:25 pp-pdf2: cleaned /tmp and sandbox/cache/
20:25 pp-pdf1: cleaned /tmp and sandbox/cache/
20:25 pp-pdf3: cleaned /tmp and sandbox/cache/
20:15 Thehelpfulone: "Site requests" was renamed to "Site configuration" under the Wikimedia product in Bugzilla, don't know who did it though
20:12 LeslieCarr: reloading analytics1001
19:50 pp-pdf2: restarted all services
19:50 pp-pdf3: restarted all services
19:50 pp-pdf1: restarted all services
19:50 pp-pdf3: add libtidy.so
19:50 pp-pdf1: add libtidy.so
19:50 pp-pdf2: add libtidy.so
19:49 pp-pdf3: install mwlib.epub
19:49 pp-pdf2: install mwlib.epub
19:49 pp-pdf1: install mwlib.epub
19:49 pp-pdf1: update simplejson to 2.5.2
19:49 pp-pdf3: update simplejson to 2.5.2
19:49 pp-pdf2: update simplejson to 2.5.2
19:49 pp-pdf1: update mwlib.rl to 0.12.11
19:49 pp-pdf2: update mwlib.rl to 0.12.11
19:49 pp-pdf3: update mwlib.rl to 0.12.11
19:48 pp-pdf1: update pip to 1.1
19:48 pp-pdf3: update pip to 1.1
19:48 pp-pdf2: update pip to 1.1
18:59 LeslieCarr: flushed mobile varnish caches after push
16:54 maplebed: kicking pdns on dobson to try and make it happy again.
16:40 notpeter: decom of all srv lower than 190
16:27 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'https://gerrit.wikimedia.org/r/#/c/9204/ - use protocol-relative url for nostalgiawiki wgSiteNotice'
16:21 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'cleanup wgNoticeBanner_Harvard2011 https://gerrit.wikimedia.org/r/#/c/9205/'
16:17 hashar: /usr/local/apache/common-local is 4G where as / is 7G on srv187. Looks like deploying wmf2 + wmf3 + wmf4 will require partitions to be resized.
16:10 hashar: srv187 and srv188 are out of disk space
16:10 logmsgbot: hashar synchronized search-redirect.php 'https://gerrit.wikimedia.org/r/9206 - cleanup search-redirect.php'
14:37 cmjohnson1: removing disk4 on virt1 for replacement
12:18 mutante: killing / restarting morebots
02:37 logmsgbot: LocalisationUpdate completed (1.20wmf4) at Tue May 29 02:37:44 UTC 2012
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Tue May 29 02:24:21 UTC 2012

May 28

23:15 logmsgbot: reedy synchronized php-1.20wmf4/extensions/UploadWizard/ 'Master UW per Kaldari'
18:25 logmsgbot: reedy synchronized php-1.20wmf4/extensions/Translate/tag/PageTranslationHooks.php 'Fix Catchable fatal error'
18:08 logmsgbot: reedy synchronized php-1.20wmf4/LocalSettings.php
18:05 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki and mediawikiwiki to 1.20wmf4
16:13 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild l10n for php-1.20wmf4
15:44 logmsgbot: reedy synchronizing Wikimedia installation... : test2wiki to 1.20wmf4 to build localisation cache
14:39 logmsgbot: reedy synchronizing Wikimedia installation... : Does running scap on its own with no wikis on that version build l10n for it? I suspect not...
14:22 logmsgbot: reedy synchronized php-1.20wmf4/ 'Staging php-1.20wmf4'
14:15 Reedy: sync-dir'ing php-1.20wmf4
14:12 Reedy: Copying php-1.20wmf4 from /tmp to /h/w/c on Fenari
13:12 apergos: doing security updates for a batch of mws in eqiad
11:22 apergos: updated kernel etc on mw1133, reboot
11:08 apergos: powercycled mw1133
09:05 apergos: rebooted snapshot4, 3 for security updates
08:43 apergos: rebooted snapshot1002, security updates (will do the same for 1003, 1004 shortly)
08:37 apergos: rebooted snapshot1001, security updates
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Mon May 28 02:24:05 UTC 2012

May 27

22:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 37134 - s:cs: site settings'
14:45 logmsgbot: asher synchronized wmf-config/db.php 'returning db51 to prod as an s4 slave'
14:34 maplebed: dns and puppet changes for s4 master rotation done.
14:18 binasher: rebooted db51, reslaved
14:09 maplebed: new s4 master position post-rotation is master_log_file="db31-bin.000334", master_log_pos=583315125
14:02 logmsgbot: ben synchronized wmf-config/db.php 's4 master switch complete; db31 is the new master. turning off read only on s4'
13:25 Tim: on db31 set global read_only=1
13:17 logmsgbot: tstarling synchronized wmf-config/db.php 's4 read-only and taking out db51'
04:27 binasher: kaulen - temporarily disabled swap and set oom_adj score to 15 for apache
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Sun May 27 02:24:13 UTC 2012

May 26

19:54 apergos: restarting apache2 on kaulen
19:34 Nemo_bis: bugzilla down again
14:19 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Settting wgBlacklistSettings'
11:05 apergos: stopping and restarting apache on kaulen blah blah blah
09:12 Reedy: Bugzilla is down, Kaulen looks to be in swap death again
06:46 apergos: powercycling kaulen
05:53 hashar: kaulen dead :-[
05:47 hashar: Bugzilla on Kaulen being super slow again
02:22 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Sat May 26 02:22:37 UTC 2012
00:40 K4-713: synchronized payments cluster to DonationInterface 04809f1cf0d

May 25

22:57 maplebed: powercycled kaulen on the mgmt interface
19:09 maplebed: disabled the outdated /etc/init.d/gmond on spence. use ganglia-monitor instead.
18:57 RobH: bugzilla appears back online
18:57 RobH: kaulen is rebooted, it may have had a runaway process or a memory leak, not sure yet, but it was locked up from access
18:55 RobH: kaulen serial console unresponsive, rebooting
17:42 paravoid: rebooting gurvin & yvon with new kernel
17:25 paravoid: resetting gurvin, load spiking at 370+, SSH unreachable, 214 days of uptime
15:34 mark: Power cycled kaulen
15:23 hashar: kaulen (bugzilla) unreacheable :-(
13:56 RobH: palladium disk replaced
13:52 cmjohnson1: replacing ps2 on mw1017
13:50 RobH: palladium has a bad disk, goign to replace it
13:37 RobH: updating drac on search18, shouldnt cause system reboot.
10:42 apergos: restarted apache on kaulen, was seeing page.cgi segfaults in dmesg and he logs, huge cpu wait spikes (why?)

May 24

23:34 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php
23:32 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Updating technical feedback email address for mobile feedback'
23:17 logmsgbot: awjrichards synchronizing Wikimedia installation... : Picking up changes to hide feedback form to prevent spamming of mobile feedback page - f6ed8ba
23:12 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Enabling 'technical feedback' link on mobile feedback form to disable feedback form'
23:00 binasher: stopped replication on es1002 in order to rsync cluster23 to es1003
21:53 logmsgbot: aaron synchronized php-1.20wmf3/extensions/UploadWizard 'deployed 144b58854e38d910210ccd23402225e5b1d2d62d'
21:52 RoanKattouw: Restarted morebots

May 23

21:46 binasher: shutting down mysql on db12 in able to restart with binlogging disabled
21:45 logmsgbot: asher synchronized wmf-config/db.php 'pulling db12 from enwiki, moving watchlist / special queries to db60'
21:34 ottomata1: upgraded udp-filter to 0.2.4 on oxygen, emery, and locke (with maplebed's help)
18:11 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Last 286 wikis over to 1.20wmf3
17:34 maplebed: deployed change to varnish configs for preilly; adding more carriers
17:13 logmsgbot: ben synchronized wmf-config/CommonSettings.php 'changing the URL for the mobile feedback page to the Project namespace'
17:12 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/ 'RL hack'
17:00 logmsgbot: reedy synchronized wmf-config/ 'Tidying unpushed changes'
15:35 RobH: ns1 died on update, restarting pdns
15:34 RobH: updated dns for analytics mgmt
02:45 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Wed May 23 02:45:44 UTC 2012
02:23 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Wed May 23 02:23:33 UTC 2012
01:10 K4-713: synchronized payments cluster to DonationInterface 4be175e43f
00:16 Ryan_Lane: flushed the varnish cache for mobile again
00:13 logmsgbot: preilly synchronized php-1.20wmf3/extensions/MobileFrontend/ 'try again'
00:11 logmsgbot: preilly synchronized php-1.20wmf3/extensions/MobileFrontend/ 'try again'
00:06 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/ 'try again'
00:00 logmsgbot: preilly synchronized php-1.20wmf3/extensions/MobileFrontend/ 'try again'

May 22

23:49 Ryan_Lane: flushed the varnish cache for mobile again
23:31 Ryan_Lane: flushed the varnish cache for mobile
23:21 logmsgbot: preilly synchronizing Wikimedia installation... : MobileFrontend Weekly Deployment
21:34 RobH: updating dns for mgmt of new servers in eqiad
20:52 logmsgbot: reedy synchronized php-1.20wmf3/extensions/MoodBar/ 'updating to master'
20:24 notpeter: powering up db1003
20:05 notpeter: starting xtrabackup dump from db1004 to db1020 for new eqiad s4 slave
19:55 notpeter: starting xtrabackup dump from db1033 to db1001 for new eqiad s1 slave
19:31 notpeter: reimaging db1001 and db1020
19:18 RobH: dns update for new servers mgmt ips
19:12 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilding message files for interwiki extension
18:44 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/file/LocalFile.php 'deployed 826f82eaccdf2a017a8ddb27829156f7c474db84'
18:44 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php
18:16 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php 'deployed 178a8597e32122feeb593219452f26864639d9ad'
17:56 maplebed: done with deploy to swift to make mediawiki write thumbnails for all wikis
17:47 logmsgbot: aaron synchronized wmf-config/swift.php 'Switched all wikis to new Swift thumb copy hook.'
17:44 maplebed: starting deploy to make mediawiki write thumbnails to swift for all wikis
17:35 logmsgbot: reedy synchronized php-1.20wmf3/extensions/TranslationNotifications/ 'Pushing new version of translationnotification out'
17:32 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'pushing interwiki loading code'
17:32 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'pushing interwiki variables out'
17:30 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Interwiki/ 'pushing interwiki code to cluster'
03:57 hashar: GlusterFS receiving 30Mbytes/sec of input traffic. Killing labs again :-D
02:50 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Tue May 22 02:50:09 UTC 2012
02:26 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Tue May 22 02:26:50 UTC 2012
01:40 K4-713: updated payments cluster to Donation Interface 67b40c9307b
00:17 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/file/LocalFile.php 'deployed dfa7120f1bcd2c172096caf0ca65a06119e592c3'

May 21

22:31 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove header fail logging'
22:30 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php
22:20 logmsgbot: preilly synchronized wmf-config/PrivateSettings.php
22:19 logmsgbot: preilly synchronized wmf-config/CommonSettings.php
21:52 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'Don't do content length checking if it's a head request'
21:47 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'Don't do content length checking if it's a head request'
20:50 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Re-enable PageTriage on enwiki'
20:47 logmsgbot: catrope synchronized php-1.20wmf3/extensions/PageTriage 'Deploying raw RL module change, AFTv5 bugfix and PageTriage bugfix'
20:47 logmsgbot: catrope synchronized php-1.20wmf3/extensions/ArticleFeedbackv5 'Deploying raw RL module change, AFTv5 bugfix and PageTriage bugfix'
20:46 logmsgbot: catrope synchronized php-1.20wmf3/includes/resourceloader 'Deploying raw RL module change, AFTv5 bugfix and PageTriage bugfix'
20:46 logmsgbot: catrope synchronized php-1.20wmf2/extensions/PageTriage 'Deploying raw RL module change, AFTv5 bugfix and PageTriage bugfix'
20:46 logmsgbot: catrope synchronized php-1.20wmf2/extensions/ArticleFeedbackv5 'Deploying raw RL module change, AFTv5 bugfix and PageTriage bugfix'
20:45 logmsgbot: catrope synchronized php-1.20wmf2/includes/resourceloader 'Deploying raw RL module change, AFTv5 bugfix and PageTriage bugfix'
18:14 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki to 1.20wmf3
18:04 Jeff_Green: dist-upgrade and reboot loudon
17:48 andrewbogott_: ran authdns-update on dobson to pick up virt1002-1008 changes
17:13 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/LocalRepo.php 'deployed dfa7120f1bcd2c172096caf0ca65a06119e592c3'
16:22 mutante: analytics1001 to 1010 installed and up in puppet
12:45 mark: Started ircecho on manganese
02:46 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Mon May 21 02:46:37 UTC 2012
02:40 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'There's no helping metawiki now!'
02:23 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Mon May 21 02:23:51 UTC 2012

May 20

19:29 logmsgbot: aaron synchronized wmf-config/swift.php 'more profiling'
17:03 logmsgbot: demon synchronized wmf-config/CommonSettings.php 'Syncing I6b0e91cd/bug 36931: tweaking account creation whitelist for ptwiki event'
02:43 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Sun May 20 02:43:08 UTC 2012
02:21 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Sun May 20 02:21:20 UTC 2012

May 19

21:07 cmjohnson1: shutting down storage3 to replace RAID controller card
18:37 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling PageTriage extension on enwiki per request from Kaldari, due to bug 36968'
02:43 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Sat May 19 02:43:56 UTC 2012
02:22 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Sat May 19 02:22:21 UTC 2012

May 18

23:27 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Set $wgSiteStatsAsyncFactor=1 on commonswiki.'
20:18 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'Tighten debugging'
20:12 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'Widen debugging'
20:06 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/FileBackendStore.php 'deployed 0624af8f2e9666fbe0820c0caca6d7ea3c6eeb7b'
20:02 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 're-enable debugging (fatal disabled)'
19:29 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Put wikibooks back on 1.20wmf3
19:27 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'Disable debugging and content length checking for now'
19:26 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'better debugging'
19:24 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'better debugging'
19:21 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'RE-enable header fail stuff with debug logs'
19:21 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Add debug log group for headerfail'
19:00 logmsgbot: reedy synchronized php-1.20wmf3/includes/HttpFunctions.php 'Trying older version'
18:36 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Push wikibooks back to 1.20wmf2 due to collection being broken
18:31 logmsgbot: reedy synchronized php-1.20wmf2/extensions/Collection
18:28 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Collection '1.20wmf2 collection for testing'
18:12 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki back to 1.20wmf3
18:11 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki back to 1.20wmf2
18:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable collection on test2wiki'
18:02 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Collection/Collection.session.php
18:00 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Collection/Collection.templates.php 'Testing partial revert'
15:45 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'Applying various changes made this afternoon: 718fb59..811dbd8'
14:25 hashar: setup a Jenkins job to lint PHP files in operations/mediawiki-config.git:/wmf-config/
14:00 mutante: authdns-update - pushing fix for reverse lookup in eqiad subnets
12:04 logmsgbot: hashar synchronized wmf-config/CommonSettings.php 'Syncing https://gerrit.wikimedia.org/r/7931 & https://gerrit.wikimedia.org/r/7934 : minor pmtpa/wmflabs switches'
10:39 logmsgbot: hashar synchronized wmf-config/wgConf.php 'https://gerrit.wikimedia.org/r/#/c/7933/ change cluster name "beta" to "wmflabs"'
08:59 logmsgbot: nikerabbit synchronized php-1.20wmf3/extensions/Translate/specials/SpecialAggregateGroups.php 'Temp fix for bug 36944'
03:01 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable UploadWizard on donatewiki and foundationwiki'
02:43 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Fri May 18 02:43:38 UTC 2012
02:22 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Fri May 18 02:22:11 UTC 2012
01:42 Tim: on cp1004: set net.ipv4.tcp_tw_recycle=0 and net.ipv4.tcp_tw_reuse=1
01:39 binasher: filejournal migration complete
00:50 binasher: migrating fliejournal to innodb on all wikis
00:22 Tim: on cp1005: set tcp_tw_recycle=0

May 17

22:18 binasher: migrated centralauth.wikiset to innodb
22:01 binasher: migrating centralauth.spoofuser to innodb via osc (13.5mil rows)
22:00 binasher: migrated centralauth.global_group to innodb
21:53 maplebed: reverted mobile change from this morning - testing completed.
21:42 binasher: es1004 is replicating again
21:39 binasher: resumed replication to es1002
21:21 logmsgbot: catrope synchronized php-1.20wmf3/extensions/ArticleFeedbackv5/ArticleFeedbackv5.php 'Deploy 24ddcdf507e615b1942147654ccde1bdc4ea4bfa'
21:20 logmsgbot: catrope synchronized php-1.20wmf2/extensions/ArticleFeedbackv5/ArticleFeedbackv5.php 'Deploy 24ddcdf507e615b1942147654ccde1bdc4ea4bfa'
21:09 logmsgbot: aaron synchronized wmf-config/swift.php '+commonswiki'
20:58 logmsgbot: aaron synchronized wmf-config/swift.php 'revert change to itwiki'
20:57 Jeff_Green: several package updates on payments* and silicon
20:56 logmsgbot: aaron synchronized wmf-config/swift.php '+itwiki'
20:40 logmsgbot: aaron synchronized wmf-config/swift.php
20:34 maplebed: deployed change to swift and mediawiki for MW to write thumbnails to swift instead of rewrite.py with aaron
20:32 maplebed: deployed parallel thumbnail purging for test, test2, and mediawiki with aaron
20:25 logmsgbot: aaron synchronized wmf-config/swift.php 'Enabled thumb copy hook for testwikis and mw.org'
20:14 binasher: completed securepoll_votes.vote_ip and all ipv6 schema migration
20:10 logmsgbot: aaron synchronized wmf-config/CommonSettings.php
20:09 logmsgbot: aaron synchronized wmf-config/swift.php
20:03 logmsgbot: aaron synchronized wmf-config/swift.php 'Enabling new purge hook on testwikis again.'
20:03 binasher: running securepoll_votes.vote_ip schema migration on s1
20:01 binasher: running securepoll_votes.vote_ip schema migration on all s2 dbs
19:19 binasher: running securepoll_votes.vote_ip schema migration on all s4 + s3 dbs
19:17 binasher: running securepoll_votes.vote_ip schema migration on all s5 dbs
19:16 binasher: running securepoll_votes.vote_ip schema migration on all s6 dbs
19:02 binasher: running securepoll_votes.vote_ip schema migration on all s7 dbs
18:49 binasher: syncing cluster23 tables from es1002 to es1004
18:46 binasher: stopped replication on es1002
18:44 notpeter: restarting puppet on brewster
18:39 logmsgbot: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
18:13 logmsgbot: aaron synchronized wmf-config/swift.php
18:08 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo 'deployed 103efda39dd57bc22898bd0e69932982c1cfd588'
18:00 Jeff_Green: shutting down grosley for disk and RAM upgrades
17:42 notpeter: temporarily turning off puppet on brewster for preseed hackz
17:20 maplebed: flushing the mobile cache post-deploy
17:17 maplebed: deploying config change to mobile - more zero IP addresses. gerrit r7867
15:31 logmsgbot: dzahn synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
15:30 mutante: sync-common-file interwiki.cdb
15:30 mutante: creating fresh interwiki.cdb from dumpInterwiki.php
15:30 Jeff_Green: adding DNS records to wikimedia.org for RT #2960
14:22 mutante: adding gerrit project analytics/udplog parent analytics
13:44 cmjohnson1: shutting down bellin for troubleshooting
09:04 hashar: Site outage was due to our custom wfLogXFF() which uses wfErrorLog(). $wmfUdp2logDest not being global there, caused exception to be shown.
08:59 hashar: Broken the cluster by having an invalid global set
08:58 logmsgbot: hashar synchronized wmf-config/CommonSettings.php
08:47 logmsgbot: hashar synchronizing Wikimedia installation... :
08:44 hashar: running scap to apply https://gerrit.wikimedia.org/r/7702
08:41 hashar: Deploying https://gerrit.wikimedia.org/r/7702 which abstract out the udp2log destination
08:15 hashar: WMFLabs seems to have recovered now
06:50 hashar: WMFLabs dieing out, I/O latency raised constantly over the last 2 hours and eventually lead to situation where system (via ssh) is not usable anymore
03:41 logmsgbot: asher synchronized wmf-config/db.php 'returning db12 and db46'
02:48 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Thu May 17 02:48:02 UTC 2012
02:22 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Thu May 17 02:22:02 UTC 2012
02:18 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Enable SpecialCite everywhere'
01:40 Tim: on cp1004: reverted after TIME_WAIT client connections reached 38k with no sign of a plateau
01:37 Tim: on cp1004: trying tcp_tw_reuse=1 instead of tcp_tw_recycle
01:00 Tim: reverted after client-side TIME_WAIT connections rose rapidly from 367 to 9000
00:59 Tim: experimentally setting net.ipv4.tcp_tw_recycle=0 on cp1004

May 16

23:50 logmsgbot: aaron synchronized php-1.20wmf3/includes/upload/UploadBase.php 'deployed 4b0a61227fce37202da2b62b7dc2474bd227873f'
22:47 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Made mediawikiwiki use $wgSiteStatsAsyncFactor=1.'
22:32 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Set $wgSiteStatsAsyncFactor=1 on testwikis.'
21:45 maplebed: reverted this morning's mobile push - tests completed
21:12 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36913 - Enable Collection on kkwiki'
21:09 logmsgbot: reedy synchronized php-1.20wmf3/cache/interwiki.cdb 'Updating interwiki cache'
21:09 logmsgbot: reedy synchronized php-1.20wmf2/cache/interwiki.cdb 'Updating interwiki cache'
20:05 binasher: ran ipv6 migrations on globalblocks
20:05 binasher: converted centralauth.globalblocks from myisam to innodb
19:57 binasher: rebooting db12 for kernel upgrade
19:51 binasher: stopping mysql on db12
19:50 binasher: recentchanges.rc_ip migration completed
19:47 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Last bits of tidying up
19:44 binasher: rebooted db46
19:38 binasher: shutting down mysql on db46, preparing to reboot for kernel upgrade
19:32 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'touch'
19:28 logmsgbot: asher synchronized wmf-config/db.php 'pulling db12 from enwiki, moving watchlist / special queries to db59'
19:25 binasher: running recentchanges.rc_ip (ipv6) schema migration on enwiki master (5.2mil rows) via os��c - batten down the hatches!
19:17 binasher: running recentchanges.rc_ip (ipv6) schema migration on s2 dbs via os��c
19:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'touch'
19:14 Reedy: manually ran ddsh -cM -g mediawiki-installation -o -oSetupTimeout=30 -F30 "sudo -u mwdeploy rsync -a 10.0.5.8::common/*.dblist /usr/local/apache/common-local" because sync-dblist is woefully out of date..
19:13 notpeter: restarting ganglia on nickel
19:09 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: 12 more misc/wikimedia wikis to 1.20wmf3
18:59 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All closed wikis to 1.20wmf3
18:55 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All special wikis to 1.20wmf3
18:54 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wikimedia wikis to 1.20wmf3
18:52 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wikisource to 1.20wmf3
18:50 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wikiquote to 1.20wmf3
18:48 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wikiversity to 1.20wmf3
18:45 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wiktibooks to 1.20wmf3
18:42 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wiktionaries to 1.20wmf3
18:40 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All wikinews to 1.20wmf3
18:31 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: All non wikipedia en projects to 1.20wmf3
18:27 binasher: running recentchanges.rc_ip (ipv6) schema migration on s3 dbs via os��c (s4 already completed during prior testing)
18:25 mutante: synced wikiversions.* files from NFS to spence local to prevent death of check_job_queue monitoring
18:21 binasher: running recentchanges.rc_ip (ipv6) schema migration on s5 dbs via os��c
18:19 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: commonswiki to 1.20wmf3, again
18:17 logmsgbot: aaron synchronized php-1.20wmf3/includes/ImagePage.php 'deployed 86e2372772e618c5d1238ae480d9f632789bbe50'
18:13 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: commonswiki back to 1.20wmf2
18:10 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: commonswiki to 1.20wmf3
18:10 binasher: running recentchanges.rc_ip (ipv6) schema migration on all s6 dbs via os��c
18:03 binasher: running recentchanges.rc_ip (ipv6) schema migration on all s7 dbs via os��c
17:43 binasher: ipblocks migration completed for all wikis
17:38 binasher: running ipblocks schema migration on all s2 dbs via os��c
17:35 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend 'Picking up fix for fatal in api in MobileFrontend at 9936e7a'
17:34 logmsgbot: awjrichards synchronized php-1.20wmf3/extensions/MobileFrontend 'Picking up fix for fatal in api in MobileFrontend at 9936e7a'
17:16 maplebed: deploying change to swift to make which containers write thumbs configurable
17:11 logmsgbot: preilly synchronized php-1.20wmf3/extensions/MobileFrontend/ 'zero and mobile changes'
17:10 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/ 'zero and mobile changes'
17:08 RobH: aluminum back online
17:00 binasher: running ipblocks schema migration on all s3 (819) dbs via os��c
16:59 binasher: running ipblocks schema migration on all s4 dbs via os��c
16:58 binasher: running ipblocks schema migration on s5/dewiki via osc
16:57 RobH: aluminum shut down for hard disk additions
16:56 binasher: running ipblocks schema migration on all s6 dbs via osc
16:51 RobH: udpating dns for osm web servers
16:50 binasher: running ipblocks schema migration on all s7 dbs via osc
16:49 logmsgbot: preilly synchronized php-1.20wmf2/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
16:41 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php 'deployed 634c3be2bba6a46e28aa997d7ab388ebf90b36a6'
16:31 maplebed: clearing the mobile varnish cache
16:29 maplebed: deploying gerrit change 7798 to the mobile varnish servers
07:41 logmsgbot: raindrift synchronized php-1.20wmf3/extensions/PageTriage/api/ApiPageTriageTemplate.php 'fixing exception bug that makes lots of logspam'
07:41 logmsgbot: raindrift synchronized php-1.20wmf2/extensions/PageTriage/api/ApiPageTriageTemplate.php 'fixing exception bug that makes lots of logspam'
06:20 Ryan_Lane: restarted lucene on search1015
05:58 Tim: setting net.ipv4.tcp_tw_recycle=1 on cp1005 seems to have fixed it, doing it on cp1004 as well now
05:52 Tim: on cp1005 setting tcp_tw_recycle=1
05:29 Tim: experimentally started squid on cp1004
04:05 hashar: updating a few plugins on Jenkins (host: gallium )
03:34 Ryan_Lane: stopped the squid process on cp1004 and stopped puppet to avoid it being restarted. it's having issues and I can't debug it right now.
03:22 Ryan_Lane: repooling squid frontend on cp1004
03:14 Ryan_Lane: depooling cp1004 and stopping the squid backend service to let some connections close
02:43 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Wed May 16 02:43:51 UTC 2012
01:10 logmsgbot: reedy synchronized php-1.20wmf3/extensions/RandomRootPage 'dark deploy randomrootpage extension (I'll enable it later)'

May 15

23:53 logmsgbot: aaron synchronized php-1.20wmf3/includes/Block.php 'deployed 7694faf68f975ea9c4888d575b33dabb84e90083'
23:42 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php
23:27 ssmollett: upgraded ganglia-monitor and gmetad from 3.1.2-2.1 to 3.3.5-2
23:26 logmsgbot: awjrichards synchronizing Wikimedia installation... :
23:24 K4-713: upgraded minfraud version on the payments account
23:22 K4-713: updated and synchronized the payments cluster to DonationInterface d997e7ea1c
23:06 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
22:50 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Disable CentralAuth logging to file'
22:50 logmsgbot: awjrichards synchronizing Wikimedia installation... : Updating MobileFrontend to 0880467
22:34 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Enable CentralAuth logging to file'
22:13 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php
21:09 logmsgbot: raindrift synchronizing Wikimedia installation... : PageTriage update
21:05 logmsgbot: raindrift synchronizing Wikimedia installation... : PageTriage update
19:40 logmsgbot: aaron synchronized wmf-config/swift.php 'disabled new hook for now.'
19:35 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Add wfDebugLog call'
19:31 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Add wfDebugLog call'
19:30 logmsgbot: reedy synchronized wmf-config/CommonSettings.php '$wgDebugLogGroups[updateTranstagOnNullRevisions] = udp://10.0.5.8:8420/updateTranstagOnNullRevisions'
19:26 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Only conditionally disable updateTranstagOnNullRevisions hook. Debugging to come'
17:45 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php
17:41 logmsgbot: aaron synchronized php-1.20wmf3/includes/filerepo/backend/SwiftFileBackend.php
17:35 logmsgbot: aaron synchronized wmf-config/swift.php 'Use new thumb purge hook for testwikis'
16:54 RobHalsell: updated apache config for wiki-pedia.org, seems the bot doesnt spam that anymore =[
16:36 mutante: srv app servers max. uptime with older kernel down to ~120 days after another bunch of upgrades
16:34 RobHalsell: updating dns for wiki-pedia.org
12:20 hashar: deployment-prep replaced most occurrences of /mnt/upload to /mnt/upload6
10:37 apergos: on db39 dropped triggers pt_osc_elwiki_recentchanges ins, del, upd, they were preventing all elwiki edits except bot edits with the complaint Table 'elwiki._recentchanges_new' doesn't exist ... binasher, doublecheck me please?
09:24 mutante: srv278 - still has issues as in reopnened RT #24 - upgrading kernel anyways
03:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'update wgUploadNavigationUrl on all cs wikis'
02:35 logmsgbot: LocalisationUpdate completed (1.20wmf3) at Tue May 15 02:35:53 UTC 2012
02:23 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Tue May 15 02:23:47 UTC 2012
01:09 logmsgbot: asher synchronized wmf-config/db.php 'returning db31 as an s4 slave'
01:05 logmsgbot: aaron synchronized php-1.20wmf3/extensions/SwiftCloudFiles/php-cloudfiles-wmf/cloudfiles.php 'deployed f20e752630575f8384083f0ad0401e250c8babf5'
01:00 binasher: shutting down mysql on db31, then rebooting
00:59 logmsgbot: asher synchronized wmf-config/db.php 'pulling db31 from s4 for kernel upgrade'
00:58 binasher: new s4 master position - MASTER_LOG_FILE='db51-bin.000114', MASTER_LOG_POS=1772578
00:57 logmsgbot: asher synchronized wmf-config/db.php 'new s4 master'
00:55 logmsgbot: asher synchronized wmf-config/db.php 'setting s4 to read-only, switching master to db51'
00:54 binasher: preparing to rotate s4 master from db31 to db51
00:48 logmsgbot: reedy synchronized wmf-config/CommonSettings.php
00:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bring in numerous shell requests from gerrit'
00:48 binasher: rebooting db51 for kernel upgrade, prior to promoting to s4 master
00:47 logmsgbot: asher synchronized wmf-config/db.php 'pulling db51 from s4 for kernel upgrade'
00:01 binasher: just completed an online schema change for commonswiki.recentchanges in prod. woo!

May 14

22:02 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
21:08 logmsgbot: reedy synchronized php-1.20wmf3/extensions/Translate/tag/PageTranslationHooks.php 'Live hack out updateTranstagOnNullRevisions'
20:08 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mediawikiwiki to 1.20wmf3
19:38 logmsgbot: reedy synchronizing Wikimedia installation... : Running scap to make sure everything is ok...
19:36 logmsgbot: reedy synchronized php-1.20wmf3/cache/l10n/ 'Resync localisation cache'
19:26 logmsgbot: reedy synchronized live-1.5/ 'Push live-1.5 new symlinks'
19:24 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki to 1.20wmf3
19:16 logmsgbot: reedy synchronized php-1.20wmf3/cache/trusted-xff.cdb
19:14 logmsgbot: reedy synchronized wmf-config/ExtensionMessages-1.20wmf3.php
19:13 logmsgbot: reedy synchronized php-1.20wmf3/extensions/ 'Push extensions out properly'
19:11 binasher: resyncing cluster22 from es1002 to es1004
19:02 logmsgbot: reedy synchronized php-1.20wmf3/LocalSettings.php 'Use newer version'
19:01 logmsgbot: reedy synchronized php-1.20wmf2/LocalSettings.php 'Use newer version'
18:58 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: test2wiki to 1.20wmf3
18:57 logmsgbot: reedy synchronized php-1.20wmf3/cache/l10n/ 'Syncing localisation cache files'
18:49 Ryan_Lane: added OATHAuth to components list for MediaWiki Extensions product in bugzilla
18:43 Ryan_Lane: switching sessions back to memcached for labsconsole
18:42 Ryan_Lane: adding OATHAuth to labsconsole
18:40 Ryan_Lane: completed upgrade to 1.20wmf2 on labsconsole
18:30 Ryan_Lane: upgrading labsconsole to 1.20wmf2
18:26 logmsgbot: reedy synchronized php-1.20wmf3 'Initial pushing of php-1.20wmf3 files to apaches'
18:12 Reedy: Killing old php-1.20wmf1 directories from apaches to save full disks
13:48 mutante: copying outdated wikiversions.dat/.cdb files from /home to /usr/local on spence, which fixes check_job_queue (thanks jeremyb)
13:07 mutante: opening a bz bug for check_job_queue issue related to CommonSettings.php BZ:36835
07:43 mutante: still upgrading/rebooting a couple srv (API) application servers with long uptime
06:22 apergos: restarted lucene search on search1016 it had stopped doing anything useful (see ganglia graphs, also nothinig wtitten to logs)
02:22 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Mon May 14 02:22:09 UTC 2012

May 13

02:24 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Sun May 13 02:24:51 UTC 2012

May 12

02:22 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Sat May 12 02:22:18 UTC 2012

May 11

22:10 logmsgbot: preilly synchronized wmf-config 'add wikimedia to zero image disable list fix header'
21:52 logmsgbot: preilly synchronized wmf-config 'add wikimedia to zero image disable list'
19:49 Reedy: ran apache-graceful-all
19:42 RobH: apache restarted by puppet run on srv286
19:31 RobH: shutting down srv286 and srv286 for power rebalancing
19:23 RobH: srv260 and srv261 back in business
19:10 RobH: srv261 & srv261 shutting down for power rebalancing within the rack
18:33 notpeter: shutting down search 13-20 for hd upgrades
18:05 maplebed: swift: deleting the unsharded version of all sharded containers
18:03 logmsgbot: catrope synchronized php-1.20wmf2/extensions/UploadWizard/ 'Deploy 4b5df1a1151ac80e309d396102e5e2a8d0c27ccb'
17:46 maplebed: deleted wikipedia-de-local-thumb container from swift. the sharded version is currently being used.
15:33 mutante: adding DNS entries for analytics hosts in new vlan 1121 (10.64.21.0/24), hosts starting at .101 to match names analytics1001 = .101 and ++
15:03 mutante: mw62 -unless somebody was on that right now it died. mgmt also just Create Instance Error
14:06 mutante: kernel upgrading / rebooting srv servers where uptime > 200 d order by uptime desc limit 1
13:12 mutante: installing package upgrades on pdf1-3 (and installed requested indic fonts via new puppet role class)
11:39 mutante: starting ms-be swift-container-auditors every once in a while
11:35 mutante: stat1 - installed new kernel, but waiting to reboot. schedule with aotto
11:24 mutante: upgrading packages/kernel on hooper, rebooting (Blog,Etherpad,Racktables)
09:21 mutante: ekrem was close running out of disk again. logrotated apache logs, changed config to: size 512M,rotate 3
08:58 mutante: package upgrades on ekrem (IRC server, WAP, Apple dict...)
08:51 mutante: rebooting marmontel (blog)
08:48 mutante: upgrading apache/mysql/kernel on marmontel (blog)
02:20 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Fri May 11 02:20:39 UTC 2012
02:00 RoanKattouw: Started Apache back up on srv200, done debugging
01:58 logmsgbot: catrope synchronized php-1.20wmf2/extensions/UserDailyContribs/UserDailyContribs.hooks.php 'Deploy 3c45831ffe1817f3dc18f06644db46b1b74173e7'
01:17 RoanKattouw: Stopping Apache on srv200 so I can use it as my guinea pig for segfault debugging
00:56 logmsgbot: tstarling synchronized php-1.20wmf2/includes/User.php 'header log'
00:49 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
00:48 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
00:40 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
00:30 Tim: restarted socat on fenari so that fatal.log is reopened
00:29 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php 'removed logging hack tweaks.'
00:29 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php 'logging hack tweaks.'
00:27 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php 'removed some temp logging'
00:27 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
00:16 logmsgbot: aaron synchronized php-1.20wmf2/includes/User.php
00:00 binasher: pulling cp1044 from lvs for testing

May 10

23:38 logmsgbot: reedy synchronized php-1.20wmf2/extensions/LiquidThreads/classes/Hooks.php 'Updating to master'
22:38 logmsgbot: catrope synchronized php-1.20wmf1/.git 'Make Special:Version show the correct commit now that I have fixed the weird repo state'
22:37 logmsgbot: catrope synchronized php-1.20wmf2/.git 'Make Special:Version show the correct commit now that I have fixed the weird repo state'
22:36 RoanKattouw: Cleaned up weird git repo states on fenari in php-1.20wmf1 and php-1.20wmf2
22:04 maplebed: swift: deleting the unsharded wikipedia-de thumb container contents (the sharded version is currently serving traffic)
19:51 notpeter: rebooting db29 for do a test install of precise
19:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36420 - Wikipedia namespace alias for sr.wp'
19:02 logmsgbot: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
18:57 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36694 - Set wgSitename on srwikisource'
18:43 LeslieCarr: restarting mobile varnish
18:33 LeslieCarr: reloaded and purged cache of mobile varnish
18:03 notpeter: starting innobackupex from db10 to blondel
17:39 notpeter: pushing out new zone files. only minor changes
16:47 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'showupdatemarker on enwiki tooooo'
03:04 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable show update markers on dewiki'
02:14 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Thu May 10 02:14:03 UTC 2012
01:07 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'https://gerrit.wikimedia.org/r/#/c/7133/'
00:11 logmsgbot: catrope synchronized php-1.20wmf2/extensions/UploadWizard 'Deploy b45437b6e09018dacfc78c8e4fa822a917858b2d / 62631485ba36f973c0d4a850ef494a8f84c4c86b'
00:11 logmsgbot: catrope synchronized php-1.20wmf1/extensions/UploadWizard 'Deploy b45437b6e09018dacfc78c8e4fa822a917858b2d / 62631485ba36f973c0d4a850ef494a8f84c4c86b'
00:06 logmsgbot: preilly synchronized wmf-config 'remove MF passwords'
00:01 logmsgbot: preilly synchronized wmf-config 'remove MF passwords'

May 9

23:33 notpeter: taking down search20 to do precise test-install
23:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable Translate on outreachwiki'
23:25 Reedy: Created Translate tables on outreachwiki
22:49 Reedy: ExtensionDistributor fixed
22:32 Reedy: Debugging ExtensionDistributor being broken. Likely to show more debug output on mw.org if you attempt to use it (though, it wouldn't give you what you wanted anyway)
22:15 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
21:53 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/file/LocalFile.php 'deployed b9ac85cbf304a65d900cda00fafe53bf82d7a227'
21:53 logmsgbot: aaron synchronized php-1.20wmf2/includes/SiteStats.php 'deployed b9ac85cbf304a65d900cda00fafe53bf82d7a227'
20:52 LeslieCarr: done
20:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bump memory limit to 128MB'
19:39 Ryan_Lane: updating OpenStackManager on virt0 to master again
19:16 Ryan_Lane: updating OpenStackManager on virt0 to master
18:54 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/file/LocalFile.php 'deployed fa1a8d5119e1174f7458eb9516287f4867c46484'
18:50 RobH: dns update for db61 and db62
18:25 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: 295 other wikipedias over to 1.20wmf2
18:20 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: jawiki to 1.20wmf2
18:16 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: ruwiki to 1.20wmf2
18:12 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: frwiki to 1.20wmf2
18:11 notpeter: turning db30 back on
18:07 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: dewiki to 1.20wmf2
17:51 cmjohnson1: to shutting down storage3
16:58 LeslieCarr: restarted mobile varnish instances
16:58 LeslieCarr: flushed mobile varnish cache
16:54 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Make sure Swift backend will have journaling too.'
16:31 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Removed backend config conditional now that everything was switched over.'
14:06 mutante: started container-auditor on ms-be1
09:24 mutante: started container-auditor on ms-be3 and 4
02:37 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Wed May 9 02:37:02 UTC 2012
02:19 Reedy: Running cleanupUploadStash.php over all wikis
02:13 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Wed May 9 02:13:10 UTC 2012
01:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36506 - Site logo for Tsonga Wikipedia -- ts.wikipedia.org'
01:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36522 - Upload link should lead to UploadWizard instead of commons:Special:Upload'
01:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36663 - Please allow bureaucrats to add and remove autoreviewer status on pt.wiki'
01:30 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wgShowUpdatedMarker enabled on anything that isn't enwiki or dewiki'
01:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36533 - Set sitename to Telugu Wiktionary'
01:25 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36595 - Please enable Extention:NewUserMessage on ml.wikipedia'
01:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36595 - Please enable Extention:NewUserMessage on ml.wikipedia'
01:21 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36571 - Please lock wikimania2011 wiki'
01:21 logmsgbot: reedy synchronized closed.dblist 'Closing wikimania2011wiki'
00:11 maplebed: started process to delete objects that don't exist in the container listings on all swift backends

May 8

23:44 K4-713: synchronized payments cluster to r115155, DonationInterface ccfbb304
23:34 LeslieCarr: purged varnish mobile cache
23:25 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobilefrontend resource version again'
23:25 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/ 'd43f5f19ff3599f16200d247b6838cfb04ef1473'
23:25 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/ 'd43f5f19ff3599f16200d247b6838cfb04ef1473'
23:22 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobilefrontend resource version'
23:11 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
23:11 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/ '2b1e8573fdbcab0feb3a2481167b68fb96abf663'
23:10 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/ '2b1e8573fdbcab0feb3a2481167b68fb96abf663'
22:53 RoanKattouw: Actually fixed it now with chmod -R g+w /h/w/conf/httpd
22:47 RoanKattouw: Fixed permissions in /h/w/conf/httpd by running find -group wikidev -not -perm 020 -exec chmod g+w \{\} \;
22:38 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/stylesheets/sections.css 'Live hack to live test broken interface on ICS devices on very large articles'
22:34 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Enable mobile url transformation on testwiki'
22:13 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'bumping MobileFrontend resource version number'
22:13 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/ 'd828a8196d8bc877afdbd1559e8e6d639b51cef7'
22:12 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/ 'd828a8196d8bc877afdbd1559e8e6d639b51cef7'
21:53 binasher: rebooting db1018 one more time
21:47 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/file/LocalFile.php 'deployed 43aa35016b03935b27d439afe9a6b3f1aad1aa8b'
21:45 Ryan_Lane: adding adminbot to the repo
21:32 binasher: rebooting eqiad core db slaves for kernel upgrade
21:29 logmsgbot: aaron synchronized wmf-config/swift.php 'Added new thumbnail purge/import hooks handlers that use the swift backend class; unused atm.'
21:23 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Added swift backend config; unused atm.'
21:15 logmsgbot: asher synchronized wmf-config/db.php 'returning db45 to service'
21:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Matched wikimania2013wiki configuration to that of wikimania2012wiki'
21:13 maplebed: delpoyed container sharding for thumbnails to swift for 'dewiki', 'fiwiki', 'frwiki', 'hewiki', 'huwiki', 'idwiki', 'itwiki', 'jawiki', 'rowiki', 'ruwiki', 'thwiki', 'trwiki', 'ukwiki', 'zhwiki' (in addition to existing sharding for commons and enwiki)
21:13 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move wikimania2013wiki to php-1.20wmf2
21:12 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Matched wikimania2013wiki configuration to that of wikimania2012wiki'
21:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Matched wikimania2013wiki configuration to that of wikimania2012wiki'
21:10 binasher: shutting down mysql across all eqiad core db slaves
20:59 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'logo for wikimania2013wiki'
20:57 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'remove w'
20:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable translate on wikimania2013wiki'
20:56 logmsgbot: aaron synchronized wmf-config/swift.php 'Switching purge hook to use new sharding scheme.'
20:54 Reedy: Created translate related tables for wikimania2013wiki
20:31 logmsgbot: reedy synchronized php-1.20wmf1/extensions/AntiBot/
20:30 logmsgbot: reedy synchronized php-1.20wmf2/extensions/AntiBot/
20:14 maplebed: creating sharded containers for swift for 'dewiki','fiwiki', 'frwiki', 'hewiki', 'huwiki', 'idwiki', 'itwiki', 'jawiki', 'rowiki', 'ruwiki', 'thwiki', 'trwiki', 'ukwiki', 'zhwiki'
19:54 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Moved remaining wikis over to new backend config'
19:34 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/
19:12 LeslieCarr: flushed mobile varnish cache
19:11 logmsgbot: awjrichards synchronized php-1.20wmf2/extensions/MobileFrontend/
19:10 logmsgbot: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/
18:37 LeslieCarr: reenabled services on fpc5 of cr1-eqiad
18:16 cmjohnson1: updating md1000 controller card firmware on storage3
18:14 LeslieCarr: turned off fpc5 on cr1-eqiad to swap
18:05 LeslieCarr: powering on fpc 5 on cr1-eqiad
18:03 LeslieCarr: powering off fpc5 on cr1-eqiad in order for RobH to physically reseat the card
17:48 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Enable TranslationNotifications on meta, incubator and wikimania2012'
17:44 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Enable TranslationNotifications on mediawikiwiki'
17:42 LeslieCarr: switching all masterships over to cr2-eqiad in preparation to reseat cr1 linecard
17:25 LeslieCarr: flushed the mobile cache
17:24 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilding localisation cache files before TranslationNotification deploy
17:18 logmsgbot: reedy synchronized php-1.20wmf2/extensions/MobileFrontend/ 'Pushing out head'
17:16 logmsgbot: reedy synchronized php-1.20wmf1/extensions/MobileFrontend/ 'Pushing out head'
17:14 RobH: asw-c1-eqiad connected to both cr1 and cr2
15:16 cmjohnson1: shutting down storage3 to replace raid card
12:40 pp-pdf1: updated mwlib to 0.13.7
12:39 pp-pdf2: updated mwlib to 0.13.7
12:36 pp-pdf3: updated mwlib to 0.13.7
11:59 mutante: merging CSS fix for broken mobile site table layout
02:18 RoanKattouw: Removed and recloned /var/lib/l10nupdate/mediawiki/extensions , it was in a weird state because magic extension submodules work now but my hacky workaround for them not working was still in place
02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed
01:12 logmsgbot: tstarling synchronized php-1.20wmf2/includes/api/ApiMain.php
01:10 logmsgbot: tstarling synchronized php-1.20wmf1/includes/api/ApiMain.php
00:44 binasher: rebooted db1034
00:42 logmsgbot: tstarling synchronized php-1.20wmf2/includes/Exception.php
00:42 logmsgbot: tstarling synchronized php-1.20wmf2/includes/DefaultSettings.php
00:37 logmsgbot: tstarling synchronized php-1.20wmf1/includes/Exception.php
00:36 logmsgbot: tstarling synchronized php-1.20wmf1/includes/DefaultSettings.php
00:20 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Switched enwiki to new backend config.'

May 7

23:52 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'bumping mobilefrontend resource version #'
23:44 logmsgbot: reedy synchronized php-1.20wmf2/extensions/PageTriage/
23:43 logmsgbot: reedy synchronized php-1.20wmf1/extensions/PageTriage/
23:35 logmsgbot: reedy synchronized php-1.20wmf2/extensions/PageTriage/includes/PageTriageUtil.php
23:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wgShowExceptionDetails to true for testwiki and test2wiki'
23:25 logmsgbot: awjrichards synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07, take 3
23:15 logmsgbot: reedy synchronized php-1.20wmf1/extensions/PageTriage/includes/PageTriageUtil.php
23:15 logmsgbot: reedy synchronized php-1.20wmf2/extensions/PageTriage/includes/PageTriageUtil.php
23:00 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'wgShowExceptionDetails = false'
22:57 Ryan_Lane: restarting glusterd processes on virt1-5
22:56 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobile resource version'
22:54 Ryan_Lane: upgrading glusterfs on virt1-5
22:49 Ryan_Lane: upgrading glusterfs on labstore1-4
22:48 binasher: running an osc against plwiktionary.recentchanges on master
22:40 paravoid: deleting 14k tmp files from spence's /home/nagios
22:35 logmsgbot: awjrichards synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07
22:34 logmsgbot: awjrichards synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07
22:24 RoanKattouw: chmod 775 /usr/local/apache/common-local/php-1.20wmf2/extensions/PageTriage with dsh as root
22:19 logmsgbot: raindrift synchronized php-1.20wmf1/resources/startup.js 'touch'
22:18 binasher: rebooting nfs2 to new kernel
22:16 logmsgbot: raindrift synchronized wmf-config/InitialiseSettings.php 'enabling PageTriage on enwp'
22:14 logmsgbot: raindrift synchronized php-1.20wmf2/extensions/PageTriage 'Syncing PageTriage to enwp, a la carte'
22:14 logmsgbot: raindrift synchronized php-1.20wmf1/extensions/PageTriage 'Syncing PageTriage to enwp, a la carte'
21:59 mutante: was still upgrading/rebooting amssq* and knsq* hosts on the side (slow,b/c upload squids). expect temp. nagios squid reports tomorrow as well. out for now.
21:44 binasher: moved default resolution for upload from eqiad to pmtpa
21:29 cmjohnson1: shutting down storage3 for troubleshooting
20:37 binasher: attempting a live online schema change for zuwikitionary.recentchanges on the prod master
20:22 LeslieCarr: (above) restarted nagios-wm on spence
20:20 LeslieCarr: restarted irc bot
20:15 binasher: rebooting db45
20:11 binasher: rebooting db1019
18:46 logmsgbot: reedy synchronized php-1.20wmf1/extensions/Collection/Collection.session.php 'head'
18:45 logmsgbot: reedy synchronized php-1.20wmf2/extensions/Collection/Collection.session.php 'head'
18:25 logmsgbot: reedy synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php
18:24 logmsgbot: reedy synchronized php-1.20wmf1/extensions/GlobalBlocking/GlobalBlocking.class.php
18:07 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki to 1.20wmf2
16:16 cmjohnson1: shutting down storage3 to reseat RAID card
15:58 cmjohnson1: Going to power cycling storage3 several times to troubleshoot hardware issue
15:15 RobH: updating firmware on storgae3
14:20 Jeff_Green: stopped cron jobs on storage3 because of RAID failure
12:49 mutante: pushing out virtual host for wikimania2013 wiki. sync / apache-graceful/all
11:18 mutante: continuing with upgrades/reboots in amssq* on the side during the day
11:09 mutante: squids - sq* done. all latest kernel and 0 pending upgrades.
09:27 mutante: rebooting bits varnish sq68-70 one by one..
08:00 mutante: upgrading/rebooting the last couple sq* servers
07:20 binasher: power cycled db45 (crashed dewiki slave)
07:05 logmsgbot: asher synchronized wmf-config/db.php 'db45 is down'
02:25 Tim: on locke: introduced 1/100 sampling for banner impressions, changed filename to bannerImpressions-sampled100.log
02:12 Tim: on locke: moved fundraising logs back where they were
02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed
01:38 Tim: on locke: compressing bannerImpressions.log
01:35 Tim: on locke: moved bannerImpressions.log to archive and restarted udp2log
01:26 Tim: on locke: moved fundraising logs from /a/squid/fundraising/logs to /a/squid so that they will be processed by logrotate

May 6

07:03 apergos: manually rotates udplogs on locke, copying destined_for_storage3 off to hume:/archive/emergencyfromlocke/ (jeff, this note's for you in particular)
06:36 apergos: bringing up storage3 with neither /a nor /archive mounted, saw "The disk drive for /archive is not ready yet or not present" etc on boot, waited a long time, finally skipped them
06:12 apergos: and powercycling the box instead. grrrr
06:05 apergos: rebooting storage3: we have messages like May 6 05:45:12 storage3 kernel: [465081.410025] Filesystem "dm-0": xfs_log_force: error 5 returned. in the log, and the raid is unaccessible, megacli doesn't run either
02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed

May 5

09:37 mutante: squids - upgrading in the sq5x range (upload)
08:53 apergos: disabling modcompress temporarily for lightty on dataset2 (live hack), let's see what that does as far as it dying. could be issue similar to http://redmine.lighttpd.net/issues/2391
06:45 mutante: squids - upgrading sq44,48 (upload)
05:23 mutante: squids - finishing a couple reboots in the sq7x range
03:04 binasher: rebooting db1006 as well
03:04 binasher: rebooting db1038, kernel uptime scheduler chaos
02:00 logmsgbot: LocalisationUpdate failed: git pull of extensions failed
00:21 logmsgbot: reedy synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php

May 4

23:46 logmsgbot: reedy synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php
23:45 logmsgbot: reedy synchronized php-1.20wmf1/extensions/GlobalBlocking/GlobalBlocking.class.php
22:35 logmsgbot: aaron synchronized php-1.20wmf2/includes/filerepo/backend/FSFileBackend.php 'deployed a807624'
22:34 LeslieCarr: clearing varnish cache and reloading varnish on mobile
21:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php
21:13 logmsgbot: reedy ran sync-common-all
20:18 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Fix typo (cswikquote vs cswikiquote)'
20:06 logmsgbot: asher synchronized wmf-config/db.php 'setting s2 writable'
20:05 binasher: performing mysql replication steps for s2 master switch to db52
20:04 logmsgbot: asher synchronized wmf-config/db.php 'setting s2 read-only, db52 (still ro) as master, db13 removed'
19:49 logmsgbot: asher synchronized wmf-config/db.php 'setting db52 weight to 0 in prep for making new s2 master'
19:32 binasher: powering off db24
18:08 LeslieCarr: reloaded mobile varnish caches and purged them
18:02 Ryan_Lane: gerrit upgrade is done
17:55 Ryan_Lane: starting gerrit
17:32 Ryan_Lane: installing gerrit package on manganese
17:28 Ryan_Lane: adding gerrit 2.3 package to the repo
17:25 Ryan_Lane: shutting down gerrit so that everything can be backed up
16:45 apergos: lighty on dataset2 is running under gdb in screen session as root, if it dies please leave that alone (or look at it if you want to investigate)
16:26 notpeter: turning off db30 (former s2 db, still on hardy, will ask asher what to do with it) to test noise in DC
15:50 mutante: rebooting sq67 (bits)
15:42 mutante: going through sq7x servers (text), full upgrades
15:32 notpeter: removing srv281 from rending pool until we figure out what's going on with it
15:23 notpeter: putting srv224 back into pybal pool
15:09 notpeter: removing srv224 from pybal pool for repartitioning
14:56 notpeter: putting srv223 back into pybal pool
14:50 mutante: going through sq6x (text), full upgrades
14:08 notpeter: removing srv223 from pybal pool for repartitioning
14:02 notpeter: putting srv222 back into pybal pool
13:50 notpeter: removing srv222 from pybal pool for repartitioning
13:43 notpeter: putting srv221 back into pybal pool
13:30 notpeter: removing srv221 from pybal pool for repartitioning
13:16 mutante: going through sq80 to sq86 (upload), full upgrade & reboot
12:56 mutante: maximum uptime in the sq* group down to 171 days, so we have like a month now for the rest. stopping upgrades for the moment being.
12:54 notpeter: starting script to move /usr/local/apache to /a partition on all remaing non-imagescaler apaches
12:47 mutante: (just) new kernels & reboot - sq45,sq49 (upload)
12:30 mark: Sending ALL non-european upload traffic to eqiad
12:23 mutante: (just) new kernels & reboot - sq63 to sq66 (209 days up)
12:06 mutante: dist-upgrade & kernel & reboot - sq42,sq43 - rebooting upload squids one by one
11:48 mutante: powercycling srv266 one more time, but now creating RT for it, once already showed CPU issue before it was reinstalled recently
11:13 apergos: restarted lighty on dataset2 ... about ... half an hour ago. stupid case sensitivity
10:02 apergos: tossed knsq1 through 7 from squid_knams dsh nodegroups file, prolly lots more cleanup where that came from
09:34 mutante: dist-upgrade/kernel/reboot: sq37, sq41. rebooting upload squid sq41
08:49 mutante: dist-upgrade & new kernel & reboot: sq33, sq36
07:47 mutante: preemptive rebooting of sq* servers identified as having > 200 days of uptime
02:22 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Fri May 4 02:22:42 UTC 2012
02:13 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Fri May 4 02:13:58 UTC 2012
00:20 logmsgbot: raindrift synchronizing Wikimedia installation... :
00:18 logmsgbot: raindrift synchronizing Wikimedia installation... : Syncing the PageTriage extension, but only enabling on testwiki
00:08 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Adding 'fr' to language codes for mobile feedback'
00:06 maplebed: moved ms1-3 from the production cluster to the test cluster

May 3

23:29 LeslieCarr: restarting networking on sq55
23:29 logmsgbot: tstarling synchronizing Wikimedia installation... :
23:27 LeslieCarr: restarting networking on sq54
23:24 LeslieCarr: restarting networking on sq53
23:21 LeslieCarr: restarting networking on sq52
23:16 LeslieCarr: restarting networking on sq51
21:30 notpeter: removing srv220 from pybal pool for repartitioning
21:29 LeslieCarr: switching asw-a4-sdtpa from single uplink to lag
21:19 notpeter: putting srv219 back into pybal pool
21:14 logmsgbot: asher synchronized wmf-config/db.php 'setting wgDefaultExternalStore to cluster23'
21:09 logmsgbot: asher synchronized wmf-config/db.php 'reverting cluster23 change'
21:05 logmsgbot: asher synchronized wmf-config/db.php 'setting wgDefaultExternalStore to cluster23'
21:02 binasher: about to move ES writes to cluster23
20:47 notpeter: removing srv219 from pybal pool for repartitioning
20:37 logmsgbot: reedy synchronized php-1.20wmf2/extensions/Collection/Collection.templates.php
20:37 logmsgbot: reedy synchronized php-1.20wmf1/extensions/Collection/Collection.templates.php
19:50 binasher: restarted profiling collector post parser.php livehack and stats.db removal
19:45 notpeter: starting script to move /usr/local/apache to /a partition on all non-imagescaler, non-jobrunner apaches
19:42 logmsgbot: aaron synchronized php-1.20wmf2/includes/parser/Parser.php 'live-hack out template profiling...again.'
19:40 logmsgbot: aaron synchronized php-1.20wmf1/includes/parser/Parser.php 'live-hack out template profiling...again.'
19:31 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Revert $wgDefaultUserOptions[enotifwatchlistpages] = 1'
19:20 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 36316 - Set Add pages I edit to my watchlist to true by default for new users'
19:15 logmsgbot: reedy synchronized wmf-config/CommonSettings.php '$wgDefaultUserOptions[enotifwatchlistpages] = 1'
19:09 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable show update markers for some more of the larger wikis'
19:00 paravoid: powercycling all of sq51-sq62, hanged due to 209 days uptime
18:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36092 - Activation of flood flag on vec.wikipedia.org'
18:43 paravoid: powercycling sq59; inaccessible via either SSH or serial due to load
18:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36183 - Fix namespace alias on Hindi Wikipedia'
18:38 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36171 - Imports from Wikibooks'
18:34 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36344 - Remove file upload facility on Gujarati wikipedia'
18:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36344 - Remove file upload facility on Gujarati wikipedia'
18:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36344 - Remove file upload facility on Gujarati wikipedia'
18:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36386 - cswikiquote user group changes'
18:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36480 - Create namespace Comments: in Greek Wikinews'
17:44 RobH: db1029 ssd test items removed, can go back to normal service via asher
17:43 notpeter: returning mw58 to pool
17:34 RobH: shutting down db1029 for ssd card testing removal per rt 2766
17:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36320 - Set $wgShowUpdatedMarker back to true on ptwiki'
17:18 notpeter: removing mw58 from pool for more testin'
17:16 LeslieCarr: reloaded and purged varnish cache for mobile in eqiad
17:03 notpeter: mwm59 out of apache pool. using it for some testing
16:16 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36359 - Add namespace 102 to $wgContentNamespaces on ptwiki Bug 36360 - Add namespace 102 to $wgNamespacesToBeSearchedDefault on ptwiki'
16:08 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 36460 - Enable chunked uploads as opt-in user preference'
16:06 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 31406 - Set $wgUseMathJax = true on Wikimedia wikis'
15:12 notpeter: chris is taking down search1-12 to replace with new search nodes
15:05 mutante: powercycling srv266
13:49 mark: Built new wikimedia-base 1.00 package, stripped of most stuff now handled by Puppet, and inserted it into the lucid-wikimedia and precise-wikimedia APT repositories
10:33 mutante: starting container-auditor on ms-be3
08:42 logmsgbot: ariel synchronized php-1.20wmf2/LocalSettings.php 'job runners don't have /home mounted'
08:16 Nemo_bis: siebrand: job queue stuck, on en.wiki jumped from o to 37k in the last ~36h
04:52 jeremyb: fixed complaints of beta simplewiki appearing in #cvn-simplewikis on freenode on the labs side. details
04:00 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:47 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:38 Tim: fixed scap, was failing on the remote side due to mwversionsinuse exiting with status 1 due to /home/wikipedia/common not existing on apaches
02:21 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:21 Tim: aborted scap and re-ran with fanout=5 instead of 30, since nfs1 CPU was maxed out
02:14 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:04 logmsgbot: aaron synchronized multiversion/activeMWVersions 'deployed r115116'
02:00 logmsgbot: LocalisationUpdate failed (php-1.20wmf2) at Thu May 3 02:00:13 UTC 2012
02:00 logmsgbot: LocalisationUpdate failed (php-1.20wmf1) at Thu May 3 02:00:12 UTC 2012

May 2

23:56 logmsgbot: aaron synchronized multiversion/ 'deployed svn HEAD'
23:41 maplebed: started swift old-object-deleter on ms-be3
23:28 maplebed: update - roan takes the blame
23:28 logmsgbot: raindrift synchronized wmf-config/InitialiseSettings.php 'Aborting todays PageTriage deployment'
23:22 maplebed: swift is recovered; ~20 minutes of impaired service. cause unknown, but the swiftcleaner looks likely.
23:18 RoanKattouw_away: Scap tried to push two new source trees to php-1.20wmf1-* and php-1.20wmf2-* , causing full disks. Cleaning up now
23:13 LeslieCarr: restarting nagios bot
22:59 logmsgbot: raindrift synchronizing Wikimedia installation... :
22:49 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/ 'contact us change'
22:48 logmsgbot: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/ 'contact us change'
21:43 logmsgbot: asher synchronized wmf-config/db.php 's2: pulling db30, raising weights on new hosts'
21:02 ^demon: finished database maintenance on db9.reviewdb
20:24 hashar: hashar: updated TestSwarm to distribute tests to Firefox 12 users.
20:12 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Re-pushing for srv219 and srv220
20:07 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
20:04 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved special, wikimedia, wikiquote, wikiversity, and wiktionary wikis to 1.20wmf2
19:59 logmsgbot: asher synchronized wmf-config/db.php 'adding dbs 52,53,57 to s2 at lower weights'
19:55 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
19:47 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: metawiki to 1.20wmf2
19:40 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: commonswiki to 1.20wmf2
19:36 preilly: fix for PHP Warning: in_array() expects parameter 2 to be array, string given in /usr/local/apache/common-local/php-1.20wmf1/extensions/MobileFrontend/skins/SkinMobile.php on line 156
19:36 logmsgbot: preilly synchronized php-1.20wmf2/extensions/MobileFrontend/skins/SkinMobile.php 'fix php notice for in_array'
19:35 logmsgbot: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/skins/SkinMobile.php 'fix php notice for in_array'
19:34 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved all wikibooks to 1.20wmf2
19:25 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved enwikibooks to 1.20wmf2
19:21 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved sourceswiki to 1.20wmf2
19:20 logmsgbot: preilly synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
19:11 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved all wikisource sites to 1.20wmf2
19:03 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved all wikinews sites to 1.20wmf2
19:00 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'only remove images for DIGI'
18:51 logmsgbot: asher synchronized wmf-config/db.php 'added ES cluster23 to templateOverridesByCluster but not activating'
18:48 binasher: creating a blobs_cluster23 ES shard table for all active projects
18:31 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikisource to 1.20wmf2
18:25 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikinews to 1.20wmf2
18:24 RobH: updating dns
18:20 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikinews to 1.20wmf2
18:09 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'only remove images for DIGI'
17:57 logmsgbot: preilly synchronized php-1.20wmf2/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
17:56 logmsgbot: preilly synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
17:46 K4-713: updated production civicrm to r1726
17:36 logmsgbot: aaron synchronized php-1.20wmf2/includes/specials/SpecialContributions.php 'Deployed 799998c3a160ef6dd3b926b7d6fec223682b788c'
17:30 logmsgbot: preilly synchronized php-1.20wmf2/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
17:28 logmsgbot: preilly synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ 'zero weekly carrier test'
17:14 logmsgbot: catrope synchronized php-1.20wmf2/skins/vector/ 'Deploying 7260cc5fe4071e03241378ba1a48bc0b6f188948'
16:51 RoanKattouw: Changing docroot/bits/skins-1.19 and other 1.19 symlinks to point to the 1.20wmf1 tree instead. This is needed because we're still getting requests for magnify-clip.png at the 1.19 URL from cached HTML
16:16 notpeter: starting innobackupex from db1040 to db1022 for new s6 snapshot slave
15:31 notpeter: no nagios bot, kicking nagios on spence
15:04 RobH: shutting down mw64 for hw test per rt 1890
15:03 RobH: bellin crashed, unresponsive to ssh or serial console
14:43 mark: Built varnish for precise as 3.0.2-2wm5 and imported it into APT repository precise-wikimedia
11:52 mark: Started distribution upgrade of server stafford from Lucid to Precise
10:41 mutante: refreshLinks.php - started it once again in a screen on hume, just for s1. last cron failed with "mwscript command not found"?? well now it is there again and running
10:09 mark___: Started distribution upgrade of server sockpuppet from Lucid to Precise
09:20 mutante: upgrading bugzilla to 4.0.6
08:43 mutante: kaulen: installing various upgrades (apache,mysql,cron,php-wikidiff2,...)
08:40 logmsgbot: hashar synchronized php-1.20wmf2/includes/GitInfo.php 'Fix Special:Version for 1.20wmf2 (commit ae12df0 , bug 36361 )'
08:20 hashar: cherry-picked ae12df0 commit to 1.20wmf2 since there are mobilefrontend commits pending.
02:35 logmsgbot: LocalisationUpdate completed (1.20wmf2) at Wed May 2 02:35:51 UTC 2012
02:32 K4-713: updated production civicrm to r1723
02:13 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Wed May 2 02:13:30 UTC 2012
01:01 notpeter: starting innobackupex from db57 to db53 for new s2 slave for the one zillionth time

May 1

22:28 logmsgbot_: asher synchronized wmf-config/db.php 'returning db45'
22:23 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db45, last coredb on prior fb mysql build'
22:17 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Enable doublepage on test2wiki'
22:11 binasher: upgraded percona-toolkit on coredbs to 2.1.1 - now with the potential to run online schema changes on tables without single column unique keys!!
21:39 binasher: created an ops db on all core mysql shards
21:00 notpeter: reinstalling db53. this time with correct raid!
20:40 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Fixing mailto links on mobilefrontend feedback form to properly populate subject lines'
19:32 LeslieCarr: reverting vrrp mastership of row a to cr2-eqiad
19:29 LeslieCarr: switching vrrp mastership of row a to cr1-eqiad
18:32 logmsgbot_: awjrichards synchronized wmf-config/InitialiseSettings.php 'Make testwiki use mobile domain for URLs'
18:28 LeslieCarr: making routing change, higher risk
17:51 Ryan_Lane: make that virt0
17:51 Ryan_Lane: switching the session cache back to filesystem on virt1, since it isn't working properly with memcache
17:29 maplebed: kicking nagios to check a change to fix the mobile LVS alert
17:25 logmsgbot_: nikerabbit synchronized php-1.20wmf2/extensions/TranslationNotifications/ 'Deploying TranslationNotifications code'
17:08 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki to 1.20wmf2
16:27 notpeter: starting innobackupex from db1034 to db53 for new s2 slave
16:27 notpeter: starting innobackupex from db57 to db52 for new s2 slave
16:03 notpeter: rebuilding db52 and db53 as s2 slaves
15:47 logmsgbot_: asher synchronized wmf-config/db.php 's1: raising db59,60 weights, pulling db52/53 for reuse'
09:23 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'hewiki account creation high throttle limits'
04:04 Tim: on all apaches, running "chmod -R a+rX /usr/local/apache/common-local/" to clean up after killed rsyncs which left files unreadable
02:23 logmsgbot_: LocalisationUpdate completed (1.20wmf2) at Tue May 1 02:23:29 UTC 2012
02:21 logmsgbot_: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/specials/SpecialMobileFeedback.php
02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Tue May 1 02:14:06 UTC 2012
02:06 logmsgbot_: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/specials/SpecialMobileOptions.php
01:51 Ryan_Lane: bringing up all labs instances with a 60 second lag
01:40 Ryan_Lane: rebooting virt0
01:35 Ryan_Lane: rebooting virt3
01:33 logmsgbot_: preilly synchronized php-1.20wmf1/extensions/MobileFrontend/HtmlFormatter.php
01:26 Ryan_Lane: rebooting virt5
01:18 Ryan_Lane: rebooting virt4
01:03 Ryan_Lane: rebooting virt2
00:51 LeslieCarr: restarted swift-container-auditor on ms-be5
00:38 logmsgbot_: tstarling synchronizing Wikimedia installation... :
00:26 Tim: removed large syslogs from mw60 and ran sync-common
00:18 Tim: on mw60 there was an actual directory at /usr/local/apache/common/php where a symlink should have been. fixed

April 30

23:58 logmsgbot_: aaron synchronized php
23:44 RoanKattouw: Started Apache back up on mw60
23:39 RoanKattouw: Running scap-1 on the Apaches with dsh
23:38 RoanKattouw: Moved /home/catrope/php-1.19 to /home/wikipedia/lazy-backups/php-1.19
23:38 Reedy: mediawiki.org to 1.20wmf2
23:37 logmsgbot_: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mw.org to 1.20wmf2
23:35 RoanKattouw: Strike that, instead moving /home/w/common/php-1.19 to /home/catrope/php-1.19
23:34 RoanKattouw: Removing /home/w/common/php-1.19 , NFS might freak out a bit
23:31 RoanKattouw: Removed php-1.19 from mw60 , synced it, and restarted Apache
23:28 RoanKattouw: Synced docroot and purged varnish for static-1.20wmf2, bits seems to be working for 1.20wmf2 now
23:27 RoanKattouw: mw60 has full disk, stopping Apache for now
22:50 Ryan_Lane: rebooting virt5
22:42 Ryan_Lane: rebooting virt3
22:35 Ryan_Lane: rebooting virt4
22:28 Ryan_Lane: rebooting virt1
22:23 Ryan_Lane: bringing down all instances (yay gluster)
21:12 pgehres: re-enabled Jenkins jobs on Aluminium after db1008 reboot
21:11 pgehres: CiviCRM back to normal after db1008 reboot
21:07 Jeff_Green: db1008 gets kernel update and reboot
21:00 pgehres: put CiviCRM on Aluminium in maintenance mode for db1008 reboot
20:59 logmsgbot_: reedy synchronized php-1.20wmf2/resources/startup.js 'touch'
20:57 pgehres: disabled all Jenkins jobs on Aluminium in prep for db1008 reboot
20:50 Jeff_Green: db1025 and storage3 get new kernels and reboot
20:28 notpeter: restarting, once again, innobackupex from db1034 to db57 for new s2 slave after fenari crash killed my screen
20:24 Reedy: Running ddsh -F30 -cM -g mediawiki-installation -o -oSetupTimeout=10 '/usr/bin/scap-1' in the hope it syncs all the files that would be nice to be on the app servers
20:18 logmsgbot_: reedy synchronized php-1.20wmf2/cache/ 'Synching whole cache directory'
19:59 notpeter: restarting nagios to get rid of some old checks
19:57 Jeff_Green: payments cluster gets kernel updates and reboots
19:55 logmsgbot_: reedy synchronizing Wikimedia installation... : Rebuiild l10n for 1.20wmf2
19:49 logmsgbot_: reedy synchronized wmf-config/ExtensionMessages-1.20wmf2.php 'Syncing file'
19:49 logmsgbot_: reedy synchronized php-1.20wmf2/LocalSettings.php 'Pushing LocalSettings.php'
19:48 paravoid: upgraded & rebooted ssl3001, ssl3002, ssl3003
19:45 logmsgbot_: reedy synchronizing Wikimedia installation... : Pushing out new symlinks etc, moving test2wiki to 1.20wmf2
19:30 logmsgbot_: reedy synchronized php-1.20wmf2 'Syncing 1.20wmf2 live hack revisions'
19:28 logmsgbot_: reedy synchronized php-1.20wmf2 'Syncing 1.20wmf1 live hack revisions'
19:26 logmsgbot_: reedy synchronized php-1.20wmf2 'Syncing 1.20wmf2 for deployment'
19:18 Reedy: Syncing php-1.20wmf2 files from NFS to apaches. Likely to upset NFS (or the uplink for the switch nfs is on) for a little while...
19:14 paravoid: rebooting ssl1004
19:06 paravoid: rebooting ssl1003
19:00 paravoid: rebooting ssl1002
18:59 notpeter: starting innobackupex from db1034 to db57 for new s2 slave
18:50 paravoid: rebooting ssl1001
18:42 Jeff_Green: grosley gets new kernel + reboot
18:35 Jeff_Green: aluminium gets kernel update, yayyyyyyy!
18:34 paravoid: pooled back ssl1; depooling ssl3 and rebooting
18:29 binasher: rebooting mw45 for kernel upgrade
18:27 Jeff_Green: power cycling aluminium which faceplanted
18:22 binasher: rebooting mw45
18:21 notpeter: rebuilding db57 again, this time with more correct raid level!
18:19 logmsgbot_: asher synchronized wmf-config/db.php 'adding db59,60 to s1 with low weights'
18:16 paravoid: depooled & rebooting ssl1
18:09 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Sanity run after script changes.
18:00 logmsgbot_: aaron synchronized multiversion
17:58 logmsgbot_: reedy synchronized php-1.20wmf1/includes/MagicWord.php 'https://gerrit.wikimedia.org/r/6135'
17:44 logmsgbot_: aaron synchronized wikiversions.cdb
17:43 AaronSchulz: updating multiversion code
08:34 mutante: reinstalling srv266
08:08 mutante: upgraded mw1,mw2,mw35
07:59 mutante: reinstalling srv206
07:50 mutante: upgrading mw36
07:37 apergos: powercycling srv266, had this message on mgmt console: Severity: Non Recoverable, SEL:CPU Machine Chk: Processor sensor, transition to non-recoverable was asserted
07:22 mutante: installing upgrades on srv212
07:19 apergos: reinstalled srv284, seems to be up now
07:17 mutante: powercycled mw8
02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Mon Apr 30 02:13:59 UTC 2012

April 29

20:13 apergos: srv206 won't run puppet, see syslog, clearing out the yaml file didn't help, since it's not urgent I'm leaving it for tomorrow
19:51 Ryan_Lane: depooling ssl3004
19:51 Ryan_Lane: removed the ipv6 addresses from maerlant and added them to ssl3001, then restarted nginx
19:50 Ryan_Lane: repooling ssl3001
19:46 apergos: powercycled mw60, same reason as the rest
19:12 apergos: power cycled mw48 and mw52 (hung just like the others)
18:05 apergos: sll3002 and 3003 were rebooted and are the entire ssl esams pool right now
18:02 apergos: ok the ssl300x situation: ssl3001 is now disabled in the pybal conf file on fenari; it is picking up the ipv6and4labs tmplate and I don't know if that's right, anyways nginx doesn't want to bind to one of those addresses. ssl3004 isn't reachable or pingable even via mgmt but at leasy lvs sees it's gone
16:34 apergos: powercycling the ssl300x.esams hosts. 212 days of uptime... (and 3001 had gone out to lunch)
12:34 mutante: and finally mw1, so just leaving mw1102 and mw60 for having other issues for a while (->Nagios)
12:22 mutante: check_all_memcached recovered, but still same treatment for mw10 and 11 (8 and 15h ago)
12:15 mutante: powercycling mw32,mw33,mw44,mw46 one by one, they were all frozen and went down between like 17 and 24 hours ago approx.
12:07 mutante: powercycling mw30
02:56 paravoid: rebooting ssl2 (has 214 days uptime)
02:47 paravoid: powercycled ssl3
02:13 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sun Apr 29 02:13:58 UTC 2012

April 28

22:53 Reedy: Job queue logs on gdash seem to have stopped on the 26th...
22:29 logmsgbot_: reedy synchronized php-1.20wmf1/includes/EditPage.php 'https://gerrit.wikimedia.org/r/6088'
21:52 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php
21:51 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php
21:12 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php
21:10 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php
21:09 logmsgbot_: reedy synchronized common/php-1.20wmf1/extensions/cldr/LanguageNames.body.php 'more debugging'
20:51 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/cldr/LanguageNames.body.php 'Add debugging'
20:49 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'Add debuglog group for language code not being a string'
19:04 logmsgbot_: reedy synchronized php-1.20wmf1/includes/ExternalEdit.php 'https://gerrit.wikimedia.org/r/6077'
19:03 logmsgbot_: reedy synchronized php-1.20wmf1/includes/api/ApiParse.php 'https://gerrit.wikimedia.org/r/6076'
02:24 Ryan_Lane: rebooting all mediawiki boxes that have uptimes affected by the bug are being rebooted at 8 minute intervals
02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sat Apr 28 02:14:14 UTC 2012
01:33 paravoid: powecycled mw29
01:21 paravoid: powercycled mw38
00:17 notpeter: db12 is sooooo sloooooow, starting innobackupex from db1017 to db60 for new s1 slave

April 27

22:15 paravoid: upgraded ssl4 to nginx 0.7.65-5wmf1 and added it back to the pool
21:45 paravoid: rebooting ssl4 after upgrading (incl. a kernel update)
20:00 notpeter: starting innobackupex from db1040 to db1022 for new eqiad s6 snapshot slave, again
19:59 notpeter: starting innobackupex from db12 to db60 for new s1 slave, again
19:58 notpeter: starting innobackupex from db1017 to db59 for new s1 slave, again
19:49 paravoid: de-pooling ssl4
19:30 mutante: test - added new gerrit interwiki prefix for SAL/wikitech - gerrit:6002
19:14 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Fix rights for afttest and afttest-hide groups'
18:25 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'Cleanup enotif related settings'
18:24 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Set wgEnotifWatchlist to true for all wikis. Leaving wgShowUpdatedMarker set to false for all the big wikis'
16:50 logmsgbot_: reedy synchronized wmf-config/CommonSettings.php 'Simplify enotif code'
16:45 notpeter: starting innobackupex from db1040 to db1022 for new eqiad s6 snapshot slave
16:45 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'wgEnotifWatchlist defaulting to true. Big wikis explicitly set to false'
12:25 mutante: fixing integration.mw testswarm and applying fixed erb template by hashar
04:35 Tim: added an account for myself on observium
04:22 logmsgbot_: tstarling synchronized wmf-config/mc.php 'increased wgMemCachedTimeout from 500ms to 3000ms for bug 35900'
02:13 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Fri Apr 27 02:13:51 UTC 2012
00:12 Ryan_Lane: upgrading gluster on all instances
00:09 Ryan_Lane: upgrading gluster on labstore1-4

April 26

23:46 logmsgbot_: asher synchronized wmf-config/db.php 'raising db58 weight'
23:09 Reedy: Recreated resources directory symlinks in bits docroot
21:21 LeslieCarr: started deletion script on ms-be4
19:20 notpeter: restarting puppet on db59
19:18 Ryan_Lane: made LiquidThreads disabled by default on labsconsole, now users must add the special string to a page to enable it there.
19:18 Ryan_Lane: enabled NewUserMessage on labsconsole
19:06 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Add group permissions settings for AFTv5'
18:33 logmsgbot_: catrope synchronizing Wikimedia installation... : Deploy AFTv5 updates
17:17 LeslieCarr: reloaded varnish on mobile caches
14:19 notpeter: cleaned log space on search1017 and search1018 and started lucene
14:04 notpeter: stopping lucene on search1017 and 1018 to take that out of the equation
13:57 mutante: installing some (security) upgrades on fenari (apt,cron,samba,...)
13:54 notpeter: restartin lucene on search1017 and search1018
13:27 logmsgbot_: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Narayamon tewiki bug 33480'
13:23 logmsgbot_: nikerabbit synchronized php-1.20wmf1/extensions/Narayam/ 'Updating Narayam'
13:03 notpeter: (re)starting innobackupex from db1017 to db59 for new s1 slave
12:56 mark: Created precise-wikimedia APT distribution
08:27 mark: Power cycled mw40
06:57 binasher: restart pybal on amlvs1 with bgp disabled
06:57 binasher: restarted pybal on amlvs2 with bgp enabled
06:47 binasher: restarting pybal on amslvs2
06:26 binasher: shifting all traffic out of esams
02:14 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Thu Apr 26 02:14:03 UTC 2012
01:42 Ryan_Lane: starting mysql on db46
01:40 Tim: on professor: restarted udpprofile collector
01:37 Ryan_Lane: powercycling db46
01:33 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db46, host down'
00:44 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php

April 25

22:14 LeslieCarr: restarted swift-container-auditor on ms-be3
21:55 RobH: pushing dns update for scs-c1-eqiad and ps1-c#-eqiad
21:22 LeslieCarr: reloading varnish on mobile caches cp1041 cp1042 cp1043 cp1044
21:21 LeslieCarr: clearing mobile varnish cache
19:38 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'Attempted fatal fix'
19:33 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/Math/ 'Deploying 4c9e7dbe761c798ce15d7e2acef829a1582c058b'
19:14 notpeter: starting innobackupex from db12 to db59 for new s1 slave, per mr. feldman's directions
18:56 notpeter: starting innobackupex from db1017 to db60 for new s1 slave
18:49 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/FeaturedFeeds/SpecialFeedItem.php 'Deployed 4fb14a7b2ca9be715b820a9847d999f21c7d2cfc'
18:36 logmsgbot_: aaron synchronized php-1.20wmf1/img_auth.php 'Deployed f7e49bd71bd8356751242c5ce1cbae076a27cf7a'
18:10 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moving all remaining wikis to php-1.20wmf1
17:07 LeslieCarr: reloaded mobile varnish configs
17:06 LeslieCarr: purging mobile cache
16:40 LeslieCarr: starting delete script on ms-be3
16:14 RobH: done moving mgmt connections and serial connections in s8-eqiad for now
16:05 RobH: reshuffling cables in eqiad for serial and mgmt connections in a8, this may affect all eqiad mgmt and serial connections for the next 5 minutes
15:29 hashar: hashar: gallium: MySQL had issues most probably because of the mysql configuration snippets. https://gerrit.wikimedia.org/r/5796 might solve that.
14:03 mutante: gallium - don't start puppet unless the erb template fix for mysql has been merged
13:52 mutante: gallium stopped puppet, moved log_slow_queries config, re-setting up mysql again
13:41 mutante: gallium/testswarm - back up after mysql upgrade and issue starting the service
13:36 mutante: gallium - dpkg-reconfigure mysql-server-5.1, mysql does not start right
13:27 mutante: running apt-get upgrade on gallium
12:29 mark: Sending US, Brazil, Indian traffic to upload.eqiad
11:39 mutante: running authdns-update to add analytics100x and labsdb100x mgmt names
05:35 paravoid: powercycled lvs6, was dead and not responding to serial
03:43 logmsgbot_: asher synchronized wmf-config/db.php 'adding db58 to s7 as a new slave with a low weight'
03:24 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db58'
03:23 logmsgbot_: asher synchronized wmf-config/db.php 'adding db58 to s7 as a new slave with a low weight'
02:28 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Wed Apr 25 02:28:47 UTC 2012
02:14 logmsgbot_: LocalisationUpdate completed (1.19) at Wed Apr 25 02:14:46 UTC 2012
00:02 binasher: profiling collector was pegged at 100% cpu and graphs were turned to swiss cheese due to a bad stats call in 1.20, now fixed

April 24

23:59 binasher: powering off db16
23:55 binasher: streaming hot backup of db1041 to db58 (building a new s7 slave)
23:48 logmsgbot_: aaron synchronized php-1.19/includes/Setup.php 'Hacked out session request stats.'
23:46 logmsgbot_: aaron synchronized php-1.20wmf1/includes/Setup.php 'Deployed 42fcd43299246ecd1b265fcfcdd01a60319cf378'
23:19 AaronSchulz: Running 'mwscriptwikiset maintenance/populateRevisionSha1.php all.dblist' on hume
22:43 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Enabled file change journal on wikis using the new backend config.'
22:20 AaronSchulz: Tables added
22:18 binasher: rebooting db16 with updated kernel. it's probably still hopeless (dimm errors)
22:18 AaronSchulz: Creating the filejournal table on all wikis
21:59 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Switched commonswiki to the new backend config format.'
21:48 logmsgbot_: asher synchronized wmf-config/db.php 'pulling db16, memory errors'
20:13 apergos: re-enabled replication via cron on ms7, it should catch up within an hour or so
20:10 binasher: reimaged db58 with fixed raid setup, imaging db59
19:51 notpeter: starting innobackupex from db1034 to db57 for new s2 slave
19:50 Ryan_Lane: repooling ssl3001
19:28 Ryan_Lane: depooling ssl3001
18:18 LeslieCarr: deploying to frontend
17:48 notpeter: deploying new squid conf to cp1001 frontend. is just a udp2log port change.
17:19 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Using newer backend for shared repos for testwiki, test2wiki, and mediawikiwiki.'
16:55 logmsgbot_: nikerabbit synchronized wmf-config/CommonSettings.php 'Translate extension configuration changes'
11:54 apergos: after much cursing and kicking zfs, a manual snapshot replication is running in screen as root on ms7 to ms8, expect it to take at least a day
11:44 mark: Sending all non-european upload traffic back to pmtpa to prepare for eqiad varnish storage rework
08:56 mutante: updated blog theme per guillaume (April commits)
08:05 apergos: temporarily disabled automatic zfs replication from ms7 -> ms8, cleared out space on ms8, catching up by hand
04:00 Ryan_Lane: powercycling ssl1
02:47 logmsgbot_: aaron synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'Fixed notice spam.'
02:45 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/MobileFrontend/MobileFrontend.body.php 'Fixed notice spam.'
02:37 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Restructed filerepo a config a bit; nothing changed yet.'
02:28 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Tue Apr 24 02:28:47 UTC 2012
02:15 logmsgbot_: LocalisationUpdate completed (1.19) at Tue Apr 24 02:15:00 UTC 2012
00:15 logmsgbot_: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/stylesheets/common.css '0be2dc1288361c51f91533f1f77e78d9279b86e0'
00:13 logmsgbot_: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/common.css 'r115019'

April 23

23:35 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Bumpging MobileFrontend resource version'
23:07 logmsgbot_: awjrichards synchronizing Wikimedia installation... :
23:02 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Add code for new URL scheme based on version_compare() logic'
22:51 logmsgbot_: awjrichards synchronizing Wikimedia installation... : MobileFrontend updates per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#23_April.2C_2012
22:33 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'bumping mobile frontend resource version'
21:49 logmsgbot_: catrope synchronized php-1.20wmf1/resources/mediawiki/mediawiki.js 'Deploy 6e55a770b26b17b8fc9b5b4fe943dcc2867df4f3'
21:27 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'Deploy 93d470b'
20:41 mutante: neon - upgraded libssl, started icinga after adding monitor group
20:32 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo/FileRepo.php 'Disabled write checks in the cleanDir() function.'
20:31 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo/FileRepo.php 'Disabled write checks in the quickImport/quickPurge functions.'
19:43 logmsgbot_: catrope synchronized php-1.20wmf1/includes/specials/SpecialListgrouprights.php 'Deploy 047543b6805a268c8d689a7a1ce12ec545ef79a9'
18:43 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'touch'
18:43 logmsgbot_: reedy synchronized flaggedrevs.dblist 'Seems I never added ukwiki to the dblist... Oh well'
18:32 logmsgbot_: aaron synchronized wikiversions.dat
18:32 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Moved enwiki to 1.20wmf1
18:28 logmsgbot_: aaron synchronized php-1.20wmf1/includes/specials/SpecialContributions.php 'Deployed 72969cf8c9a403430c8c93fc20ab3118328c4d9c'
17:06 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Made mediawikiwiki use the newer backend config.'
14:33 notpeter: stopping puppet on cp1041 as well
14:17 notpeter: temp stopping puppet on cp1042-1044
13:09 mutante: powercycling frozen mw25, looks like mw21 above but no console output to paste here
13:07 mutante: fix puppet run on spence by removing searchidx1 resources from db9 (was in weird state being in site but also decommissioned)
11:23 mutante: mw21 powercycling mw21 - it died with this http://etherpad.wikimedia.org/mw21
10:55 mutante: force-reload ircecho on manganese to make gerrit-wm rejoin #mediawiki
10:48 hashar: banned CIA bots from #mediawiki IRC channel. It started spamming us with notifications from KDE and mandriva projects. See http://permalink.gmane.org/gmane.science.linguistics.wikipedia.technical/60905
10:30 mutante: searchidx1 was in site.pp and decom.pp at the same time. breaks puppet runs on spence. cannot override local resource. removing from site
10:27 mutante: killed a couple morebots processes on wikitech and it came back by itself :p

April 21

02:29 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sat Apr 21 02:29:40 UTC 2012
02:15 logmsgbot_: LocalisationUpdate completed (1.19) at Sat Apr 21 02:15:20 UTC 2012

April 20

22:03 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Switched test2wiki to use the new LocalRepo config style.'
22:01 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Switched testwiki to use the new LocalRepo config style.'
21:52 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'Added NFS backends for local/shared repos; they are not used yet.'
21:12 LeslieCarr: starting swift delete script on ms-be2
20:02 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo/file/LocalFile.php 'deployed c77fbd394cda701758ad4523113f567bff7ede66'
19:45 apergos: powercycled mw4, it was unresponsive to pings and via mgmt
18:48 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/ArticleFeedbackv5 'Apply https://gerrit.wikimedia.org/r/5449'
18:48 logmsgbot_: catrope synchronized php-1.19/extensions/ArticleFeedbackv5 'Apply https://gerrit.wikimedia.org/r/5449'
18:07 notpeter: restarting nginx on ssl1002 and ssl1004 as they are not back up
18:01 logmsgbot_: awjrichards synchronizing Wikimedia installation... :
17:31 logmsgbot_: catrope synchronized wmf-config/CommonSettings.php 'Remoev wgArticleFeedbackv5OversightEmails override that was messing things up'
17:15 notpeter: stopping puppet on locke and emery. just to be safe...
17:11 RoanKattouw: Fixed ownership of /h/w/common/php-1.20wmf1/cache/l10n , should be owned by l10nupdate but was owned by reedy
17:01 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 36124 - Deploy ProofreadPage extension on test2'
17:00 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Giving test2wiki moar namespaces'
16:11 mutante: add missing memcached servicegroup to nagios, restarted
15:10 mutante: apache error log on stafford has ruby exceptions re: phusion_passenger
15:01 mark: Converted OSPF directly connected redistributed routes from type 2 to type 1
14:51 mutante: starting swift-container-auditor on ms-be1
14:30 mark: Disabled down-pref of Tampa AS2828 routes
13:14 logmsgbot_: demon synchronized php-1.20wmf1/maintenance/backupTextPass.inc 'Pushing out Idb58ce27 for Ariel/Chris for dumps'
13:10 mark: Sending India upload traffic to upload-lb.eqiad
12:40 mark: Disabled iptables firewalls on internal prod swift cluster servers as it's dropping packets
12:22 mutante: restarted pdns on ns2
11:19 mark: Sending US upload traffic to eqiad as well
10:27 mark: Sending Brazil upload traffic to eqiad
08:39 hashar: Gave up running l10nupdate script it has some file permissions issues. Opened bug 36119 and bug 36120
08:36 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Fri Apr 20 08:36:53 UTC 2012
08:27 logmsgbot_: LocalisationUpdate completed (1.19) at Fri Apr 20 08:27:36 UTC 2012
08:13 hashar: rerunning l10nupdate for bug 34938
08:02 hashar: running l10nupdate for bug 34938
06:27 pgehres: re-eanabled PayPal on donatewiki and wmfwiki and resumed queue consumer on Aluminium
05:32 LeslieCarr: flushing mobile varnish cache
04:56 pgehres: disabled paypal on donatewiki and disabled queue consumer for duration of PayPal outage
02:33 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Fri Apr 20 02:33:02 UTC 2012
02:23 logmsgbot_: LocalisationUpdate completed (1.19) at Fri Apr 20 02:23:57 UTC 2012
01:47 logmsgbot_: awjrichards synchronizing Wikimedia installation... : r114983 on wikis still running 1.19

April 19

23:33 binasher: powercycled es1004
21:08 Jeff_Green: changed nagios contactgroup fundraising from tfinc/awrichards --> jgreen
21:03 RoanKattouw: Scap is broken in some weird way, it just stops running after the scap1-skins step. Doesn't run scap-1 (which does the actual sync), doesn't log "sync done", doesn't update graphite
21:01 logmsgbot_: catrope synchronizing Wikimedia installation... : Running scap again, AFTv5 is acting up
19:34 logmsgbot_: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
19:29 RoanKattouw: Running scap to deploy AFTv5 updates, and running AFTv5 schema changes on enwiki at the same time
18:50 logmsgbot_: catrope synchronized wmf-config/InitialiseSettings.php 'Set wmgArticleFeedbackv5OversightEmails for enwiki'
18:25 notpeter: nothing obvious in logs on db1005, starting mysql
18:15 notpeter: rebooting db1005. it's dead, jim.
17:52 RoanKattouw: Running schema changes for AFTv5 on testwiki
17:51 Jeff_Green: discovered nfs1 had ~1K redundant iptables rules, removed extras and reloaded
17:42 Jeff_Green: discovered sanger had ~7K redundant iptables rules, removed extras and reloaded
13:56 mutante: adding refreshLinks cron jobs to hume per RT-2355 (via puppet). if there should be any performance issues, schedule can be changed like <cluster>@<hour> in mediawiki.pp (and/or remove mediawiki::refreshlinks from hume and clear out the jobs of user mwdeploy)
08:35 mutante: emery - "udp2log_age" says some squid logfiles have not been written to in 6 hours, but from the filenames looks like this isnt a reason to worry, right
07:49 mutante: stat1 - this also needs udp2log stuff fixed. currently Could not find class misc::udp2log::udp-filter
07:47 mutante: gilman - what's up with it? closes SSH, does not like mgmt pass, was running jenkins but broken
07:43 mutante: owa[1-3] They dont have real puppet freshness issues, it's rather firewalling and the snmp traps
02:30 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Thu Apr 19 02:30:33 UTC 2012
02:21 logmsgbot_: LocalisationUpdate completed (1.19) at Thu Apr 19 02:21:31 UTC 2012

April 18

22:55 LeslieCarr: updating exim4.conf on mchenry to not allow old ranges
21:03 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files:
20:47 logmsgbot_: catrope synchronized php-1.20wmf1/resources/startup.js 'touch'
20:46 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/SyntaxHighlight_GeSHi/ 'Deploying GeSHi fix https://gerrit.wikimedia.org/r/#change,4949'
20:04 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: specieswiki and foundationwiki to 1.20wmf1
19:56 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/LiquidThreads/classes/Hooks.php 'Avoid fatals on invalid title in API'
19:51 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All *wiki wikis to 1.20wmf1
19:25 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikiquote and wikiversity projects to 1.20wmf1
19:22 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikibooks to 1.20wmf1
19:18 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikinewses to 1.20wmf1
19:07 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wikisources to 1.20wmf1
19:07 logmsgbot_: catrope synchronized wmf-config/mc.php 'Swap out 10.0.2.251 (down) with 10.0.11.24 (spare). This is the last spare, there are now NO SPARES LEFT in mc.php'
19:00 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: All wiktionaries to 1.20wmf1
18:57 logmsgbot_: aaron synchronized php-1.20wmf1/extensions/LiquidThreads/classes/Dispatch.php 'Added type hint for better fatals'
18:44 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikiversity to 1.20wmf1
18:43 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikiquote to 1.20wmf1
18:41 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikibooks to 1.20wmf1
18:40 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikinews to 1.20wmf1
18:39 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwiktionary to 1.20wmf1
18:32 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: enwikisource to 1.20wmf1
17:20 logmsgbot_: catrope synchronized docroot/bits/ 'Remove static-1.00 again'
16:57 logmsgbot_: catrope synchronized docroot/bits 'Add docroot/bits/static-1.00 for testing'
16:41 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Set wmfUseRevSha1Columns to true for enwiki'
13:30 mutante: applied a patch to etherpad that allows admins to delete pads
12:53 mutante: restarting/fixing etherpad issue
11:08 mark: Sending European bits traffic back to esams
02:30 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Wed Apr 18 02:30:50 UTC 2012
02:21 logmsgbot_: LocalisationUpdate completed (1.19) at Wed Apr 18 02:21:49 UTC 2012
02:13 logmsgbot_: catrope synchronized php-1.20wmf1/README 'Dummy sync to capture which hosts time out on sync-file'
00:52 K4-713: updated production civi to r1631
00:41 Ryan_Lane: adding interface for per-project sudo on OpenStackManager

April 17

23:36 K4-713: updated production civi to r1628
23:12 logmsgbot_: catrope synchronized wmf-config/InitialiseSettings.php 'Fixes for cswiktionary changes per Danny B'
22:49 RoanKattouw: That was bug 34885 of course
22:43 logmsgbot_: catrope synchronized php-1.19/extensions/WikiEditor/ 'Deploy fix for bug 348885'
22:38 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/WikiEditor/ 'Deploy fix for bug 348885'
22:05 K4-713: updated prod civi to r1625
21:51 logmsgbot_: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero needed for carrier testing'
21:42 logmsgbot_: aaron synchronized wmf-config/CommonSettings.php 'use $wmgUseMathJax'
21:41 logmsgbot_: aaron synchronized wmf-config/InitialiseSettings.php 'use $wmgUseMathJax'
21:38 K4-713: queue consumer re-enabled
21:35 K4-713: updated prod civi to r1623
21:32 logmsgbot_: aaron synchronized wmf-config/InitialiseSettings.php
21:29 logmsgbot_: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/templates/ApplicationTemplate.php 'ec7c5cc'
21:28 logmsgbot_: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r114947'
21:24 logmsgbot_: aaron synchronized wmf-config/InitialiseSettings.php 'Enabled $wgUseMathJax on mediawikiwiki'
20:33 logmsgbot_: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.flagging.php
20:26 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/VisualEditor/ 'Deploy VisualEditor beta warning'
19:52 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Bump mobile resource version'
19:52 logmsgbot_: awjrichards synchronized php-1.20wmf1/extensions/MobileFrontend/
19:51 logmsgbot_: awjrichards synchronized php/extensions/MobileFrontend/
19:50 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'bumping mobile frontend resource version'
19:01 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php
18:55 logmsgbot_: reedy synchronized php-1.19/includes/api/ApiQueryBlocks.php 'r114941'
18:53 logmsgbot_: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version'
18:47 binasher: returning sq68
18:36 binasher: pulling sq68 from pybal for a bit
18:29 RoanKattouw: Did a graceful restart of all job runners using dsh about 15 mins ago
18:29 RoanKattouw: Restarted morebots
07:44 apergos: morebots test
07:44 apergos: restarted varnish service manually a bit a go on sq67 and sq70, the cron job didn't seem to have gone off. restarted morebots too while I was at it
03:37 Jeff_Green: dist-upgrade arsenic
03:29 LeslieCarr: restarting varnish on arsenic again
03:12 maplebed: started a script to delete old objects on ms-be1 for swift truncated object cleaning
02:53 Jeff_Green: dist-upgrade on strontium
02:43 LeslieCarr: restarted varnish on arsenic
02:26 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Tue Apr 17 02:26:40 UTC 2012
02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Tue Apr 17 02:17:24 UTC 2012
01:44 LeslieCarr: restarting varnish on niobium
00:52 LeslieCarr: reloading amslvs4
00:27 logmsgbot_: aaron synchronized php-1.20wmf1/includes/filerepo 'deployed 552ff0f482f3e65e9795fe304dd810e9ae1b03fb'

April 16

23:31 logmsgbot_: catrope synchronizing Wikimedia installation... : Now with a touch of the specific WikiEditor.i18n.php file
23:11 logmsgbot_: catrope synchronizing Wikimedia installation... : Hopefully fixing the WikiEditor messages this time, now with MessagesEn.php touch
23:07 logmsgbot_: catrope synchronizing Wikimedia installation... : Hopefully fixing the WikiEditor messages this time
22:58 logmsgbot_: awjrichards synchronizing Wikimedia installation... : Updating MobileFrontend to r114934
22:49 logmsgbot_: catrope synchronizing Wikimedia installation... : Need to run scap for this WikiEditor change, contains i18n changes
22:39 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/WikiEditor/ 'Deploy WikiEditor revert'
20:53 logmsgbot_: catrope synchronized php-1.20wmf1/extensions/WikiEditor/ 'Actually deploy the recent WikiEditor fixes'
18:58 logmsgbot_: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: Commons Wiki to 1.20wmf1
18:47 logmsgbot_: reedy synchronized php-1.20wmf1/resources/mediawiki/mediawiki.js
18:46 logmsgbot_: reedy synchronized php-1.20wmf1/extensions/WikiEditor
18:37 mutante: manually added iptables nat rules on nfs2
18:13 notpeter: upgrade of udp2log on nfs1/2 complete. should be operating normally now.
17:41 mutante: LDAP on nfs2 warnings - opendj was _just_ started there when puppet was fixed with an unrelated issue
17:38 mutante: restarting opendj on nfs2 because it refused connections
17:08 logmsgbot_: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ 'zero and mobile changes'
16:07 notpeter: upgrading and restarting udp2log on nfs1/2
15:04 mutante: puppet fresh on nfs[12] after removing nonexistent misc::mediawiki-logger class
14:46 mark: Shutdown db24 for memory testing by Chris
13:27 mark: Sending European bits traffic back to pmtpa
12:24 mark: Sending European bits traffic back to esams
12:06 mark: Testing sess_leak_fix2 patch with a snapshot varnish build on cp3001
11:56 Reedy: Ran ddsh -cM -g mediawiki-installation -o -oSetupTimeout=30 -- "cd /usr/local/apache/common && sudo -u mwdeploy ln -s php php-1.18" to create symlink for php-1.18
11:51 Reedy: Killing php-1.18 again
11:48 mutante: sq34 - System halted! Error: Internal Storage Slot, powered down, -> RT
11:45 logmsgbot_: reedy synchronized php-1.18/ 'Symlink php-1.18 back to php (our current main running version) as lots of requests on bits are for 1.18 resources'
11:44 mutante: sq34 was broken and died when connecting to mgmt, powercycling
11:37 mutante: nfs1 - Could not find class misc::mediawiki-logger for nfs1
10:57 Krinkle: bits.wikimedia.org back up, mark fixed it.
10:33 Krinkle: bits.wikimedia.org serving Error 503 Service Unavailable on all load.php requests for mediawiki.org and nl.wikipedia.org, maybe more
09:45 logmsgbot_: reedy synchronized wmf-config/InitialiseSettings.php 'Set wgEnableJavaScriptTest to true for test2wiki'
02:26 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Mon Apr 16 02:26:58 UTC 2012
02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Mon Apr 16 02:17:57 UTC 2012

April 15

17:35 logmsgbot_: reedy synchronized php-1.20wmf1/includes/api '/me whistles'
17:20 logmsgbot_: reedy synchronized php-1.20wmf1/includes/api
02:25 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sun Apr 15 02:25:58 UTC 2012
02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Sun Apr 15 02:17:19 UTC 2012

April 14

18:14 mark: Shifting european bits traffic back from esams to pmtpa, session leak is still there
17:08 mark: Shifting european bits traffic back from pmtpa to esams
15:31 mark: Reverted varnish to 3.0.2-2wm4 on cp3001; the race condition patch did not fix the problem
14:56 mark: Sending European bits traffic to pmtpa for testing
13:52 mark: Backported varnish bug #897 patch to varnish 3.0.2, testing a snapshot build on cp3001
11:37 mark: Raised session_max to 300000 (runtime) on cp3001/cp3002
05:58 K4-713: re-enabled the queue consumer on aluminium
02:26 logmsgbot_: LocalisationUpdate completed (1.20wmf1) at Sat Apr 14 02:26:55 UTC 2012
02:17 logmsgbot_: LocalisationUpdate completed (1.19) at Sat Apr 14 02:17:34 UTC 2012
02:16 K4-713: updated prod civi to r1616
01:36 K4-713: turned off queue consumption on prod civicrm
01:36 K4-713: updated production civicrm to r1614

April 13

20:53 mark: Rebooting cp3002
20:37 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114889'
17:54 Jeff_Green: created new repo operations/debs/wikimedia-search-qa to stay within package naming conventions
17:31 notpeter: upgrading udplog on locke to 1.8-2 and restarting, etc
17:27 Jeff_Green: created new operations/debs/search-qa repo for packaging search qa scripts
17:17 notpeter: restarting udp2log on emery
12:53 notpeter: restopping puppet on locke/emery
12:09 mark: Deploying varnish 3.0.2-2wm4 and enabling persistent storage on all even numbered eqiad upload varnish hosts
11:46 mark: Imported varnish 3.0.2-2wm4 into the Wikimedia APT repository
02:48 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:39 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Fri Apr 13 02:39:01 UTC 2012
02:20 logmsgbot: LocalisationUpdate completed (1.19) at Fri Apr 13 02:20:35 UTC 2012
01:40 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/templates/ApplicationTemplate.php 'fix robots file'
01:18 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/ 'zero and mobile changes'
01:06 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'fix html formatter'
00:56 logmsgbot: tstarling synchronizing Wikimedia installation... :
00:08 Ryan_Lane: rebooting ssl1004

April 12

23:39 logmsgbot: tstarling synchronizing Wikimedia installation... :
23:08 logmsgbot: preilly synchronizing Wikimedia installation... : zero rated mobile access changes and mobile frontend updates
21:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34923 - namespace required for PORTAL'
19:46 notpeter: stopping puppet on locke and emery
18:41 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying ArticleFeedbackv5 updates
18:22 Reedy: Ran namespaceDupes against bewiki
18:17 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34024 - Install ArticleFeedback on es.wikinews'
18:15 logmsgbot: reedy synchronized php-1.19/resources/startup.js 'touch'
18:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34024 - Install ArticleFeedback on es.wikinews'
18:11 Reedy: Created AFT tables on eswikinews
17:54 RoanKattouw: Running schema updates for ArticleFeedbackv5 on enwiki
17:46 RoanKattouw: Deploying ArticleFeedbackv5 updates to testwiki and rebuilding localization cache
16:54 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Allow bnwiki crats to grant/remove import'
16:49 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35258 - Allow bureaucrats to remove sysop rights on fr.wikipedia'
16:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix imports for wm2012'
16:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35917 - allow transwiki imports on wikimania2012'
16:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35666 - Renaming Namespace Wikisource:Author in gu.wikisource'
16:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35694 - Add enotif on page changes in watchlist (guwiki and source)'
16:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35818 - Change of Armenian Wikipedia namespace'
16:20 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35905 - Change namespaces configuration - pl.wikipedia'
16:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35261 - Add block permissions in rollback on Lusophone Wikipedia'
16:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35823 - Wikijunior and cookbook namespaces for the Vietnamese Wikibooks'
16:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35659 - Set logo for sl.wikiversity'
16:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35853 - Set a non-empty default value for wmgArticleFeedbackBlacklistCategories on WMF wikis'
15:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35878 - Enable e-mail notifications for watchlist (EnotifWatchlist) on tawiki'
15:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35852 - Add a category to $wgArticleFeedbackBlacklistCategories for Portuguese Wikipedia to remove AFT from disambiguation pages'
15:10 mutante: gallium - after files have been deleted/moved, puppet back to normal operation (and new clone directory in Apache)
13:23 mutante: killed puppets on gallium
12:33 mark: repooled ssl1002
12:27 mutante: powercycling frozen ssl1002
12:22 mark: Manually depooled down ssl1002 in pybal
02:24 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Thu Apr 12 02:24:29 UTC 2012
02:15 logmsgbot: LocalisationUpdate completed (1.19) at Thu Apr 12 02:15:54 UTC 2012

April 11

22:37 maplebed: deployed more log filters to emery: gerrit/r4758
21:35 LeslieCarr: restarted nrpe on db10
21:33 LeslieCarr: db1004 puppet is fubar
21:33 LeslieCarr: restarted puppet on db30
21:33 LeslieCarr: restarted puppet on mw1110
19:41 notpeter: reimaging bellin and blondel
19:28 logmsgbot: reedy synchronized live-1.5/ 'fix resources symlinks'
19:23 logmsgbot: reedy synchronized live-1.5/ 'fix resources symlinks'
16:54 notpeter: enabling notifications for eqiad lucene vips
16:31 mark: Sending Canadian upload traffic to the eqiad varnish upload cluster
15:59 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 4 to eqiad. for realz this time!'
15:45 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 1 and prefix pool to eqiad. for realz this time!'
15:31 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 2 to eqiad. for realz this time!'
15:15 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool 3 to eqiad. for realz this time!'
14:40 notpeter: restarting indexer on searchidx2
13:48 logmsgbot: reedy synchronized php-1.20wmf1/extensions/AbuseFilter/special/SpecialAbuseLog.php
13:35 mutante: applied patch-RT-2804.diff to bugzilla per RT:2804 re: XMLRPC content-type verification
12:07 mutante: moved another list: museum-l -> glam (http://lists.wikimedia.org/pipermail/glam/2012-April/000000.html)
11:58 mark: Setup cp1036 with the persistent storage backend
02:26 logmsgbot: LocalisationUpdate completed (1.20wmf1) at Wed Apr 11 02:26:28 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Wed Apr 11 02:17:55 UTC 2012
00:11 LeslieCarr: nagios down

April 10

23:50 RoanKattouw: Removed srv187-189 from /etc/dsh/group/job-runners , their jobrunner class has been commented out in puppet since October
23:31 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'bug 35869 - Add strategywiki as an import source on testwiki'
22:53 RoanKattouw: Trying a graceful restart of the job runner on mw1 by sending SIGHUP to the jobs-loop.sh process
22:53 logmsgbot: catrope synchronized php-1.19/extensions/WikimediaMaintenance/jobs-loop.sh 'r114834'
22:24 logmsgbot: reedy synchronized php-1.20wmf1/extensions/CentralAuth/ 'g4102'
22:23 logmsgbot: reedy synchronized php-1.20wmf1/extensions/AntiSpoof/ 'g4103'
21:20 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling mobile URL template for mediawiki.org (using "mediawikiwiki" this time)'
21:18 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling mobile URL template for mediawiki.org'
21:08 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mediawikiwiki to 1.20wmf1
21:04 logmsgbot: reedy synchronized php-1.20wmf1/extensions/MobileFrontend/javascripts 'minified JS'
20:55 logmsgbot: reedy synchronized docroot/ 'Fix symlinks'
20:45 logmsgbot: reedy synchronized docroot/
20:35 logmsgbot: reedy synchronized docroot/
20:31 logmsgbot: reedy synchronized live-1.5/
20:24 logmsgbot: reedy synchronized php-1.20wmf1/ 'Resyncing for apaches with no space'
20:23 logmsgbot: reedy synchronized live-1.5 'Fix symlinks'
20:18 Reedy: Deleting php-1.18 from all apaches due to lack of space
20:14 logmsgbot: reedy synchronized php-1.20wmf1/extensions/PrefSwitch/ 'PrefSwitch is needed by SimpleSurvey'
19:35 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilding localisation cache for test2/1.20wmf1
19:24 logmsgbot: reedy synchronized wmf-config/ExtensionMessages-1.20wmf1.php 'Sync ExtensionMessages'
19:23 logmsgbot: reedy synchronized php-1.20wmf1/extensions/ 'Would you like some extensions to go with that, sir?'
19:21 LeslieCarr: restarting gmond on db1004 after removing it's 5gig log
19:07 logmsgbot: reedy synchronized php-1.20wmf1/LocalSettings.php 'Push LocalSettings out'
19:05 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: test2wiki to 1.20wmf1
19:00 logmsgbot: reedy synchronized php-1.20wmf1/ 'Pushing files for 1.20wmf1'
18:03 logmsgbot: aaron synchronized wmf-config/swift.php 'Catch e bogus empty file names from listings'
14:17 robh: search in eqiad is being reinstalled, no need to be alarmed (thats a pun!)
14:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wgLanguageConverterCacheType for git deployment later'
11:50 mutante: pxe boot / reinstall cp1029 - cp1036
11:24 mark: Imported varnish 3.0.2-2wm3 into the Wikimedia APT repository
09:30 apergos: restarted slaving on es1003, it will be a bit before it catches up. patience, young nagios
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Tue Apr 10 02:16:58 UTC 2012
01:33 Tim: on sodium: enabling mod_auth on lists.wikimedia.org by running puppet

April 9

23:14 mutante: migrated foundation-l to wikimedia-l (users/passwords/archive urls/settings stay, old mail address & siteinfo redirect)
22:32 logmsgbot: asher synchronized wmf-config/db.php 'returning db12 as enwiki recentchange/watchlist db'
21:39 LeslieCarr: restarted mysql on es1004 and cleared out its disk space
17:49 LeslieCarr: moving es monitoring to nrpe and variables, may cause false pages if i did it wrong :)
17:36 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Bug 35426 - WebFonts on mr.wikisource.org'
14:54 RobH: i killed eqiad search nodes, woooo
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Mon Apr 9 02:17:22 UTC 2012

April 8

08:45 Nemo_bis: Servers have been very slow, almost unresponsive, and network had a drop of ~0.3 Gb/s, at ~8.35-40.
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Sun Apr 8 02:16:58 UTC 2012

April 7

17:55 logmsgbot: reedy synchronized wmf-config/codereview.php 'Remove deferred paths'
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Sat Apr 7 02:16:54 UTC 2012

April 6

22:23 LeslieCarr: deploying new squid config to all squids
22:14 LeslieCarr: added neon into tiertwo of squid allowed hosts
22:13 LeslieCarr: deploying new squid config to amssq35
21:55 LeslieCarr: restarted puppet on spence
21:35 LeslieCarr: moved jenkins_1.458_all.deb to /srv/wikimedia/incoming/ on brewster
21:32 LeslieCarr: restarted squid on brewster
18:27 Ryan_Lane: updating OpenStackManager to r114758 on virt0
17:33 mark: Sending Japanese upload traffic to varnish in eqiad
17:15 mark: Power cycled down host lvs5
16:43 mutante: changed master and started slave on es1004
15:55 mutante: used gerrit create-project to create operations/debs/wikistats.git
14:13 mutante: manganese (gerrit) now sends SSL CA certificate on https, (curl -vvv says verify ok), should resolve RT:2777 and BZ:35709
11:51 mutante: es1004 - rsync was finished, deleted all binlogs from old host, mysqld_safe& , but did not "change master.." and "start slave" (see mail)
11:39 notpeter: restarting lsearchd on search3... again...
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Fri Apr 6 02:17:37 UTC 2012
01:21 Ryan_Lane: updating OpenStackManager to r114757 on virt0
00:18 Ryan_Lane: updating OpenStackManager to r114754 on virt0

April 5

23:49 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Change guwikisource logo to point to the unscaled file instead'
21:46 notpeter: halting db15 for it to await decom
21:39 binasher: started enwiki.revision sha1 migration on db12
21:32 notpeter: restarting lsearchd on search18
21:22 logmsgbot: asher synchronized wmf-config/db.php 'pulling db12, moving enwiki watchlist,recentchange,etc to db53'
21:19 logmsgbot: asher synchronized wmf-config/db.php 'returning db53'
21:17 logmsgbot: py synchronized wmf-config/lucene.php 'pushing all search traffic back to pmtpa'
18:34 Ryan_Lane: updating OpenStackManager to r114746 on virt0
18:19 Ryan_Lane: updating OpenStackManager to r114744 on virt0
16:49 RobH: brewster puppet running again, cisco installs wont work again until i finish puppetizing the files later today
15:41 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search pool4 to eqiad. this is the smaller wikis shard'
15:40 notpeter: pointing search pool4 to eqiad (this is the "smaller languages" shard)
15:14 Rob_H: puppet daemon being halted on brewster, i need to make local test changes to dhcp
14:52 logmsgbot: py synchronized wmf-config/lucene.php 'pushing search prefix pool live in eqiad'
14:51 notpeter: pushing search prefix pool live in eqiad
14:51 mutante: gallium - disabled incompatible GitTool plugin on jenkins and restarted it
14:34 mutante: importing jenkins_1.458_all.deb to wikipedia apt repo and upgrading it on gallium
14:08 apergos: started rsync in screen session as root on es1003 copying snapshot from es1001 to /a/
14:04 andrewbogott: created labs account for cneubauer
14:02 logmsgbot: py synchronized wmf-config/lucene.php 'pointing enwiki search and enwiki.prefix at eqiad'
14:00 notpeter: pointing enwiki and enwiki.prefix at eqiad search cluster
13:48 mutante: gallium - upgraded all pear packages
13:45 mutante: gallium - upgraded phpunit and php_codesniffer via pear (have been installed via pear before, distro outdated)
13:43 mutante: gallium - upgrading pear
13:33 mutante: installing package upgrades on gallium. apache,apt,postgres,php5-*,ruby,...various libs
13:24 logmsgbot: py synchronized wmf-config/lucene.php 'pointing de, fr, ja, es, ru, nl, pl, pt, zh, and sv search at eqiad'
13:21 notpeter: pointing de, fr, ja, es, ru, nl, pl, pt, zh, and sv search at eqiad
12:27 notpeter: search1 and search4 seem to be dead. restarting lsearchd
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Thu Apr 5 02:16:52 UTC 2012
00:33 Ryan_Lane: updating OpenStackManager to r114730 on virt0
00:24 Ryan_Lane: updating OpenStackManager to r114729 on virt0
00:19 Ryan_Lane: updating OpenStackManager to r114728 on virt0
00:12 Ryan_Lane: updating OpenStackManager to r114726 on virt0
00:00 Ryan_Lane: updating OpenStackManager to r114724 on virt0

April 4

22:16 maplebed: deployed (3rd time's the charm!) udp-filter changes to emery for diederik
22:14 logmsgbot: py synchronized wmf-config/lucene.php 'pointing all search back to pmtpa'
22:13 notpeter: flipping all search back to pmtpa (until tomorrow...)
22:00 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback 'r114717'
21:24 cmjohnson1: replacing power cable to psu1 (bottom) es1
21:22 cmjohnson1: replacing power cable to psu1 (top) es1
21:14 logmsgbot: py synchronized wmf-config/lucene.php 'pointing de, fr, and ja search at lvs pool in eqiad for live testing'
21:12 notpeter: moving de, fr, and ja search to eqiad
21:04 cmjohnson1: replacing power cable on labstore2 array psu2 (right side)
21:00 cmjohnson1: replacing power cable on labstore1 array psu1 (left side)
20:57 cmjohnson1: removing power from bottom power supply labstore 2
20:54 cmjohnson1: removing power from top power supply on labstore2
19:44 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Enable wmgArticleFeedbackv5AbuseFiltering on enwiki'
19:40 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Disable wmgArticleFeedbackv5AbuseFiltering on enwiki'
19:14 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114716'
19:12 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Enable wmgArticleFeedbackv5AbuseFiltering on enwiki'
19:04 RobH: dns update for zhen mgmt
18:54 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying AFTv5 update
18:52 logmsgbot: py synchronized wmf-config/lucene.php 'pointing ru, nl, pl, pt, zh, and sv search at lvs pool in eqiad for live testing'
18:51 notpeter: moving ru, nl, pl, pt, zh, and sv search to eqiad
18:27 mutante: nuked /a contents on es1004, started rsync from es1001
18:16 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Add code for wmgArticleFeedbackv5AbuseFiltering'
18:16 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Add wmgArticleFeedbackv5AbuseFiltering, enabled on testwiki only'
17:55 RoanKattouw: Running AFTv5 schema changes on enwiki
17:47 RobH: i didnt crash the site, weeee
17:46 RobH: gracefully restarting apaches
17:46 RobH: pushing out redirects change to apaches for wikipedia.org/com.il redirect to he.wikipedia.org
17:41 binasher: started enwiki.revision sha1 migration on db53
17:38 logmsgbot: asher synchronized wmf-config/db.php 'returning db52, pulling db53'
17:32 RobH: update done, all nameservers still online
17:31 RobH: dns update for wikipedia.org/com.il being resolved
17:08 RoanKattouw: Applying AFTv5 schema change on testwik
15:30 logmsgbot: py synchronized wmf-config/lucene.php 'pointing eswiki search at lvs pool in eqiad for live testing'
15:28 notpeter: pointing eswiki search at eqiad
12:51 mutante: db1007 - add mysql startup via 'update-rc.d mysql defaults'
12:42 apergos: started mysqld on db1007 via /etc/init.d/mysql (this doesn't seem to point to a special fb build, and can't seem to find one on this host, what's up with that?)
12:31 apergos: rebooted bd1007, it was dead in the water (also no helpful messages on console, bah)
11:16 mutante: enabled Renameuser extension on wikitech, renamed tchay per RT request, disabled extension again (it was installed but disabled)
02:19 logmsgbot: LocalisationUpdate completed (1.19) at Wed Apr 4 02:19:03 UTC 2012
01:50 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/FileOp.php 'deployed r114697'
01:39 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'

April 3

23:17 LeslieCarr: updating bgp policies on cr1.sdtpa
22:44 LeslieCarr: reinstalling neon
22:04 maplebed: rolled back changes to emery in udp-filter due to the new binary crashing.
21:50 maplebed: ran /etc/init.d/udp2log reload on emery to enact the puppetted changes
21:41 maplebed: deploying new udp-filter and teahouse filters to emery for diederik
20:13 notpeter: restarting lsearchd on search7. was taosted
18:37 logmsgbot: root synchronized wmf-config/mc.php
18:37 RobH: syncing new mc.php, forgot to check for all three of the servers i took down, opps.
18:28 RobH: shutting down mw28, mw49, & mw58 for rack relocation due to power overload in d2-pmtpa, relocation to d1-sdtpa per rt 2692
17:59 K4-713: Synchronized payments cluster to r114642
17:52 logmsgbot: reedy synchronized php-1.19/extensions/MobileFrontend/
17:38 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix commas in guwikisource namespaces'
17:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix commas in guwikisource namespaces'
16:38 RobH: bringing down srv237 for phase balancing
16:37 RobH: srv230 back in rotation
16:26 RobH: shutting down srv230 for power phase move per rt 2759
16:10 RobH: updating brewster to use new dhcp files for cisco, no more local hackin.
15:53 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35482 - Add Patroller & Autopatroller groups on ml.wikisource'
15:51 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35482 - Add Patroller & Autopatroller groups on ml.wikisource'
15:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35545 - Grant the abusefilter-log-detail right to patrollers on Commons'
15:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35545 - Grant the abusefilter-log-detail right to patrollers on Commons'
15:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35624 - Subject namespace for the Vietnamese Wikibooks'
15:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35603 - Enable Transwiki import on KN:WP'
15:16 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35581 - Closure of nz.wikimedia.org'
15:15 logmsgbot: reedy synchronized closed.dblist 'Bug 35581 - Closure of nz.wikimedia.org'
13:35 Tim: manually reloaded rsyslogd on all apaches
06:16 Tim: deploying limited/split apache syslog (https://gerrit.wikimedia.org/r/#change,4149)
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Tue Apr 3 02:16:32 UTC 2012
00:37 logmsgbot: aaron synchronized php-1.19/includes/Block.php 'deployed r114672'

April 2

23:54 Tim: restarting all apaches with apache-restart-all-hard
23:51 logmsgbot: tstarling synchronized php-1.19/extensions/ConfirmEdit/FancyCaptcha.class.php
23:37 logmsgbot: tstarling synchronizing Wikimedia installation... :
23:36 maplebed: cleared the varnish cache for preilly
23:34 Tim: on all apaches: running logrotate -f and deleting the resulting backup syslog files, to free up disk space
23:32 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r114673'
23:21 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version number'
23:05 logmsgbot: awjrichards synchronizing Wikimedia installation... : Deploying MobileFrontend changes at r114671 per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#2_April.2C_2012
21:43 maplebed: reverted changes to emery's logging due to a broken package in the deploy.
21:30 LeslieCarr: turned down ms7's secondary ethernet port to prevent the flapping (stupid sun boxes)
19:51 maplebed: deploying new udp-filter to emery rt-2501 gerrit/r4120
19:51 notpeter: running authdns-update on dobson
18:30 RobH: brewster puppet daemon stopped, doing local hacks
18:17 RobH: removed old bin files on db1004 and prolly borked it by removing the wrong files
17:54 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php '35436 - Enable Narayam at Hindi Wikipedia'
17:47 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add disable images option for default on zero domain'
17:45 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Bug 35328 - Enable WebFonts for fr.wikisource.org'
17:40 logmsgbot: nikerabbit synchronized php-1.19/languages/Names.php 'I18ndeploy r114656'
17:15 preilly: carrier testing push for DIGI
17:15 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
16:46 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Mon Apr 2 02:16:47 UTC 2012

April 1

02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Apr 1 02:17:22 UTC 2012

March 31

10:22 mutante: srv222,225 were also upgraded but stopping there for now in favor of reinstalls
09:58 mutante: nuked /usr/shared/doc on a couple srv's, hey at least 700MB or something, and yes we really should reinstall with a decent partitioning scheme as M ark said
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 31 02:18:10 UTC 2012

March 30

19:37 hashar: configured jenkins on gallium to use smtp.pmtpa.wmnet as outgoing SMTP server
19:28 RobH: puppet daemon restarted on brewster
18:13 RobH: killing puppet daemon on brewster, i need to hack at local configuration for cisco server stuff
12:56 mutante: db1047 - added system startup for /etc/init.d/mysql
12:47 mutante: powercycling db1047
12:28 mutante: deleted old kernel sources on upgraded srvs for that little extra space during peaks, suggesting to nuke /usr/share/doc if there should be more disk space warnings
10:41 mutante: same for srv223
09:18 mutante: srv224,srv219,srv220, upgrade apache, dist-upgrading w/ kernel, disabling ureadahead, rebooting one by one
08:06 mutante: storage3 - gmond unable to find the metric information for any mysql_* .."module has not been loaded", starting mysql, running puppet ...
07:57 mutante: powercycling storage3
07:03 Tim: running bug 35578 cleanup script in screen on fenari
06:41 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialBlock.php
06:40 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialBlock.php
06:39 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialBlock.php
06:15 Tim: killed vi on fenari owned by awjrichards, locking CommonSettings.php for two days
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 30 02:17:56 UTC 2012
01:13 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove more crap'
01:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove some dupe code'
01:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove wmgUsabilityPrefSwitch'
00:59 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove wmgUsabilityPrefSwitch'
00:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove unused wmgUseUsabilityInitiativeAlpha'

March 29

23:49 logmsgbot: aaron synchronized php-1.19/includes/revisiondelete/RevisionDeleteUser.php 'deployed r114619'
21:20 LeslieCarr: rebooting db47
20:28 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Swap wgUseCommaCount to wgArticleCountMethod'
20:07 notpeter: restarting lsearchd on search2 to del the logfile to end all logfiles
20:05 RoanKattouw: Stopping and starting Gerrit on manganese to apply Chad's change of the -1 text in the DB
20:02 notpeter: restarting lsearchd on search7 to del the logfile to end all logfiles
18:11 logmsgbot: catrope synchronized php-1.19/extensions/ClickTracking/ClickTracking.hooks.php
17:59 RobH: search1021 coming back up, done with tests
17:53 RobH: search1021 coming down for ssd fit test
17:07 notpeter: disabling notifications for search lvs nagios checks for 24 hours to test fix
15:42 notpeter: finished clearning up all pmtpa search hosts. hey look! they all have lots of space now!
15:15 notpeter: restarting lsearchd on search3
15:02 RobH: brewster puppet re-enabled
15:02 RobH: virt1001 pxe boots via dhcp and fails tftp download, i have to hold off on further troubleshooting until i have a network admin
14:47 RobH: did virt1001 wrong, reupdating dns
14:39 RobH: all nameservers still online after udpate
14:37 RobH: updating dns for virt1001 testing
14:29 RobH: stopping puppet runs on brewster so my hacking at the dhcpd.conf file won't get overwritten until I have it working right
14:01 Jeff_Green: restarted varnish on on cp3002 because it was thrashing futiley
13:45 notpeter: rebooting (mostly) down cp3001
13:17 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add participation namespace to metawiki per request'
13:11 notpeter: trimming logs and such on search1-20
09:59 mutante: srv221, disabling ureadahead, installing package upgrades and new kernel, rebooting
09:40 mutante: kill and start lsearchd on search7
09:36 mutante: restarted defunct lsearchd on search6
09:10 mutante: gallium - added demon,hashar,reedy to group jenkins as it's a problem using puppet when users and groups already exist
06:25 mutante: powercycling sq40
06:21 mutante: installed more package upgrades on sodium
05:58 mutante: installed security upgrades on brewster, cadmium, capella (apache,mysql,ruby,apt..)
05:49 mutante: db42 - mysql did not autostart after boot, added using update-rc.d
05:42 mutante: db42 - reboot worked despite the grub warning about unreliable blocklists
05:37 mutante: rebooting db42 to finish upgrades
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 29 02:17:53 UTC 2012

March 28

23:27 Tim: running apt-get upgrade on mw22,mw66,srv193,srv250,srv253,srv236
23:25 Tim: cleaned up stuck apt-get process on srv236
23:22 Tim: cleaned up stuck apt-get processes on mw22,mw66,srv193,srv250,srv253
21:44 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping mobile frontend resrouce version'
21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/application.min.js 'r114576'
21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/banner.min.js 'r114576'
21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114576'
21:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r114576'
21:42 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/application.js 'r114576'
21:42 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/javascripts/banner.js 'r114576'
20:43 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 28 20:43:20 UTC 2012
20:29 notpeter: restarted search1020. nothing conspicuous in logs
19:56 RoanKattouw: Running a patched version of l10nupdate that rebuilds the localization cache
18:49 logmsgbot: catrope synchronizing Wikimedia installation... : Bugfixes for ArticleFeedbackv5, ArticleFeedback and ClickTracking
16:47 cmjohnson1: msw1-d1-pmtpa replacement complete
16:34 cmjohnson1: replacing msw-d1-pmtpa per rt2639
15:36 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bump account creation limit per request on -tech'
15:34 Reedy: srv221 is full
15:34 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bump account creation limit per request on -tech'
14:39 RobH: restarted morebots in screen on wikitech, no longer as catrope, as roan has root on that box
14:36 RobH: got virt1001 to pxe, but dhcp doesnt know how to handle, need subnet details.
14:34 notpeter: lucene hosed on search9 and search15. restarting, then will look after cause
13:14 Jeff_Green: restarting puppet/puppetmaster on stafford to experiment with report settings
02:10 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 28 02:10:34 UTC 2012

March 27

23:12 logmsgbot: tstarling synchronized php-1.19/cache/trusted-xff.cdb
20:19 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33789 - Enable botadmin usergroup on ml.wikipedia'
19:20 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix lezwiki namespace'
19:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove ruwiki arbcom talk from namespaceprotection'
19:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35138 - Create Gujarati Wikisource'
18:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35138 - Create Gujarati Wikisource'
18:22 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35138 - Create Gujarati Wikisource'
18:16 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34351 - Create Wikisource in Belarusian'
18:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34351 - Create Wikisource in Belarusian'
18:10 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34351 - Create Wikisource in Belarusian'
18:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35290 - Create Slovenian Wikiversity'
18:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35290 - Create Slovenian Wikiversity'
17:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Config for lezwiki'
16:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Config for lezwiki'
16:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Config for lezwiki'
16:48 logmsgbot: reedy ran sync-common-all
16:32 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'prep work for new wikis'
16:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34527 - Create a Arbcom namespace on Russian Wikipedia'
16:06 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34527 - Create a Arbcom namespace on Russian Wikipedia'
15:47 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35161 - Incubator configuration updates'
15:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 32825 - Favicon for siwiki'
14:40 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35516 - Add Skin: namespace to MW.org'
08:15 apergos: test you silly morebot

07:59:56 hashar: archived old server admin logs since the old page was too long for my connection to download :-/
06:59:02 apergos: !log powercycled emery, it was unresponsive via the mgmt console and not pingable
02:17:52 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 27 02:17:52 UTC 2012
00:56:51 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114507'
00:55:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/LeaveFeedbackTemplate.php 'r114508'
00:42:50 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bmping resource version for MobileFrontend'
00:41:58 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/beta_common.css 'r114509'
00:37:30 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping MobileFrontend resource version #'
00:36:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/MobileFrontendTemplate.php 'r114507'
00:36:09 logmsgbot: awjrichards[00:36:36] synchronized php/extensions/MobileFrontend/templates/LeaveFeedbackTemplate.php 'r114508'
00:35:50 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/beta_common.css 'r114508'
00:08:55 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114506'

March 26

23:18:17 logmsgbot: awjrichards synchronizing Wikimedia installation... : Syncing MobileFrontend to r114504 changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#26_March.2C_2012
22:44:53 RobH: !log also rolling firmware to ps1-d[1|2|3]-pmtpa
22:28:10 RobH: !log pushing firmware updates to servertechs in sequence: ps1-[a2|a3|a4|a5|b2|b3|b4|b5|c1|c2|c3|d1|d2|d3]-sdtpa, disregard any errors from rebooting alerts
19:55:09 notpeter: !log stopping puppet on search6 and search15 for 24 hours to test new log rotation script
19:19:35 RobH: !log cp1019 memory replaced per rt 2651
19:07:14 apergos: rebooting ms1001 (new kernel)
17:53:34 RobH: cp1019 coming down for memory replacement per rt 2651
17:51:39 RobH: fluorine disk upgrade done, os install pending, details on rt 2350
17:43:48 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'i18ndeploy r114492'
17:36:51 RobH: fluorine coming down for new disks
17:14 notpeter: backingup plwiki.nspart1 index on search7, deleting working copy, and restarting lsearchd. (note: this will probably cause some downtime on some languages while the proc restarts...)
15:18 RobH: db59 has errors, but as it was a fusion io testbed server, it is more than likely tweaked for such, it is not in any rotation
14:54 RobH: db59 shutting down for io card removal per rt 2589
13:37 mutante: while on it, installing a whole bunch of package updates on db42
13:25 mutante: db42 was out of disk , caused by ~5G citations.csv in /tmp, gzipped the file
09:59 mutante: ..and on ms-be-3. running puppet on db59
09:43 mutante: another corrupted .yaml file on ssl2
09:33 mutante: brewster - delete puppet lock file, restart lighttpd, puppet ...
09:05 mutante: brewster was out of disk - deleted lighttpd access.log.1, gzipped access.log
08:24 mutante: on several mw* boxes puppet did not run because .yaml files on the puppetmaster became corrupted. need to delete the $hostname files in /var/lib/puppet/yaml/node on stafford and re-run. puppet bug similar to http://projects.puppetlabs.com/issues/7836
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 26 02:18:03 UTC 2012

March 25

22:26 RobH: row b servertech firmware in eqiad all updated, should clear alarms as they come back online
22:18 RobH: firmware updates on servertechs in row b eqiad, disregard alarms
20:14 RobH: to fellow ops, you can disregard those observium errors, as I caused them
20:13 RobH: firmware updated on all power strips in row a eqiad.
16:22 RobH: ps1-a1-sdtpa firmware update complete
16:15 RobH: updating firmware on ps1-a1-sdtpa
16:14 RobH: ps1-b1-sdtpa firmware updated successfully
16:14 RobH: ps1-a1-eqiad firmware updated successfully
16:09 RobH: updating firmware on ps1-s1-eqiad and ps1-b1-sdtpa
16:07 RobH: updated firmware successfully on ps1-a8-eqiad, if it has observium alarms now then there are bigger issues.
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 25 02:17:21 UTC 2012
00:59 LeslieCarr: admin down asw-a-eqiad xe-1/1/2 and cr2-eqiad xe-5/0/0 due to framing errors causing packet loss and lacp sporadic timeouts. source of the issue

March 24

19:46 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'Following a performance regression reported on wikitech-l, added merciless profiling to ExtMobileFrontend::DOMParse()'
19:46 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'Following a performance regression reported on wikitech-l, added merciless profiling to ExtMobileFrontend::DOMParse()'
17:35 mark: Migration from br1-knams to cr2-knams completed.
17:09 mark: Migrated second knams-esams dark fiber link from br1-knams to cr2-knams
16:36 mark: Corrected MTU setting on cr2-knams's AMS-IX interface
16:20 Reedy: Some european users reporting oruting issues
16:01 mark: Cleared OSPF session between csw1-esams and csw2-esams which magically made some internal routes reappear
15:40 mark: Brought up AMS-IX ipv4 BGP sessions
15:30 mark: Brought up AMS-IX ipv6 BGP sessions
15:25 mark: Moved AMS-IX connection to cr2-knams:xe-1/1/0
15:22 mark: Shutdown all AMS-IX BGP sessions
15:06 mark: Disabled BFD on OSPF3 between cr2-knams and csw1-esams
14:49 mark: Moved AS6908 and AS1257 PIs to cr2-knams
14:18 mark: Brought up AS13030 and AS1299 BGP sessions on cr2-knams
13:57 mark: Shutdown AS1299 BGP session on br1-knams
13:14 mark: Established full iBGP mesh with added router cr2-knams. cr2-knams now has full Internet connectivity.
12:48 mark: Moved fiber from br1-knams:e1/2 to cr2-knams:xe-0/0/0
12:44 mark: Disabled br1-knams:e1/2 (DF leg 1 to esams)
12:43 mark: Rack mounted and powered up cr2-knams
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 24 02:17:02 UTC 2012

March 23

23:49 logmsgbot: reedy synchronized php-1.19/extensions/MoodBar/ 'r114466'
23:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34005 - Change uploader flag configuration on Russian Wikipedia'
23:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34005 - Change uploader flag configuration on Russian Wikipedia'
23:09 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wmgAutopromoteOnce'
23:08 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wgAutopromoteOnce'
23:07 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wgAutopromoteOnce'
23:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wmgAutopromoteOnce empty arrays'
22:24 RobH: scs-a1-eqiad back online
21:58 RobH: scs-a8-eqiad coming down for re-grounding
19:51 RobH: all power strips in eqiad are now properly grounded
18:12 maplebed: removed ms1 and most of ms2 from the production swift rings. no effect expected.
18:04 logmsgbot: asher synchronized wmf-config/db.php 'returning db32, pulling db52 for migration'
16:44 RobH: cp1019 in middle of firmware update, please dont touch
16:44 RobH: cp1017 memory error seems ot have cleared post firmware update, will keep an eye on it for the rest of the day
16:09 RobH: raid rebuilding on magnesium, however swift stuff is kind of black box mystery right now to me, need Ben to review magnesium later for that
15:53 RobH: magnesium coming back online
15:44 RobH: shutting down magnesium for disk swap
15:37 RobH: firmware updating on cp1017, no one touch it please
15:30 RobH: db1020 can go back into whatever rotation Asher wants it in
15:29 RobH: db20 memory error on raid controller resolved with firmware updarte
06:39 logmsgbot: tstarling synchronized php-1.19/includes/filerepo/file/LocalFile.php 'r114442'
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 23 02:18:35 UTC 2012
01:55 mutante: deleting puppet report files older than 60hours on stafford to free disk space

March 22

23:30 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
23:18 RobH: db1020 firmware still updating, will check on it later tonight. offline until then
22:19 notpeter: all 3 dns servers are responding to digs after reload
22:10 notpeter: pushing a new zone file to add 2 more search-related vips for eqiad
20:52 notpeter: stopping puppet on brewster temporarily
20:25 notpeter: rebuilding search1015 and 1016 for disk shuffles
20:01 RobH: magnesium goign down and up again, troubleshooting the disks
19:47 apergos: rebooting ms1002, had stuck rsyncs, and kswapds at 100% cpu, weirdness like "ls /export/upload/wikipedia/am/0/00" hanging.
18:08 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
15:45 RobH: search 1015 and search1016 back up with added disks
15:08 RobH: shutting down search1015 & search1016 for hdd additions
14:45 RobH: db1020 still offline, requires firmware update on raid controller per rt 2621, will perform later today
14:33 logmsgbot: reedy synchronizing Wikimedia installation... :
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 22 02:17:47 UTC 2012
01:14 K4-713: Re-enabled the donations queue consumer in Jenkins
00:28 binasher: started enwiki.revision alter on db32
00:26 binasher: disabled lvm snapshots and puppet on db32 for revision sha1 alter
00:24 logmsgbot: asher synchronized wmf-config/db.php 'pullin db32 for revision alter'

March 21

22:27 ^demon|away: wmf-deployed extensions now r/o in SVN
21:52 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php
21:27 Ryan_Lane: bringing up all instances on virt3
21:08 cmjohnson1: swapped 2 DIMMS in virt3 (b2 and b5)
21:01 Ryan_Lane: shutting down virt3 to replace dimms
20:47 ^demon: /trunk/phase3 is now r/o in SVN
20:21 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable prefswitch'
20:10 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Set $wgArticleFeedbackv5OversightEmails on enwiki'
18:59 maplebed: rebooted ms-be3 after it crashed.
18:51 binasher: brought db24 back up after hang, and reslaving, but leaving out of db.php. just replicating until a replacement s2 snapshot host is built
18:51 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying ArticleFeedbackv5 update
18:46 logmsgbot: asher synchronized wmf-config/db.php 'returning db36'
18:35 logmsgbot: asher synchronized wmf-config/db.php 'pulling db24, failing hw'
18:03 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php
18:01 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Temporarily disable ShortUrl on testwiki because we think it might conflict with ArticleFeedbackv5'
17:59 K4-713: updated and synchronized payments cluster to r114382
17:47 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
12:25 notpeter: disabling notifications for search-pool1
08:58 mutante: rebooting ms-be4
08:37 mutante: stopped/started lsearchd on search9
08:05 mutante: ms-be4 down but cant powercycle it yet..Unable to establish LAN session / ipmitool /ipmi_mgmt
07:58 mutante: restarted lsearchd on search3 and 9
05:23 logmsgbot: tstarling synchronized php-1.19/includes/parser/CoreParserFunctions.php
05:23 logmsgbot: tstarling synchronized php-1.19/includes/parser/Parser.php
05:23 logmsgbot: tstarling synchronized php-1.19/includes/parser/StripState.php
05:22 logmsgbot: tstarling synchronized php-1.19/tests/parser/parserTests.txt
03:51 mutante: added "lez" to langlist and running authdns-update, for lez.wikipedia per RT-2665
03:29 mutante: magnesium - shutting down, has existing RT-2669 to replace disk
03:18 mutante: magnesium - "..drive on port B of the Srial ATA controller is operating outsde of normal specifications.. Strike F1 key to continue"..
03:16 mutante: powercycling magnesium - down and just "init: tty4 main" on mgmt, frozen
03:10 mutante: running puppet on aluminium
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 21 02:18:10 UTC 2012
01:06 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114342'
00:25 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'fix beta logo'
00:03 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'fix beta logo'

March 20

23:19 Ryan_Lane: fixing the zero redirect
22:46 logmsgbot: reedy synchronized wikipedia.dblist 'test'
22:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/api/ApiQueryExtracts.php 'r114319'
22:09 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumping resrouce version # for MobileFrontend'
21:54 logmsgbot: awjrichards synchronizing Wikimedia installation... : Pushing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#20_March.2C_2012
21:46 binasher: stopped eqiad bits servers from udplogging to emery, packet loss is back to zero
20:59 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page only for mswiki'
20:17 binasher: killed enwiki.revision sha1 migrator (upgrade-1.19wmf1-2.php). after db36 completes, will run the rest by hand
19:52 Ryan_Lane: pushing change for zero.wikipedia.org to redirect to the english message
19:41 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page'
19:16 cmjohnson1: pulling disk 5 on virt1 for reseating
18:34 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header of landing page'
18:02 pgehres: flipped Template:CC-status on wmfwiki since credit cards are still disabled on payments.wikimedia.org
17:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35193 - Enable sub page feature in Telugu Wikisource'
17:49 notpeter: restarting lsearchd on search10
17:30 logmsgbot: aaron synchronized php-1.19/includes/filerepo/file/LocalFile.php 'deployed r114285'
17:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Revert that then'
17:01 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Test something for sewikimedia'
16:42 logmsgbot: reedy synchronized wmf-config/abusefilter.php 'Bug 35355 - Reset of permissions for Hindi Wikipedia'
16:40 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove hiwiki botadmin from whGRoupsRemoveFromSelf'
15:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35296 - Namespace names changing on Komi Wikipedia'
15:50 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35296 - Namespace names changing on Komi Wikipedia'
15:49 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35296 - Namespace names changing on Komi Wikipedia'
15:30 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35161 - Incubator configuration updates'
15:25 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 31209 - Enable the WikiLove extension for incubator'
14:57 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove more group dupes'
14:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35355 - Reset of permissions for Hindi Wikipedia (hiwiki)'
14:14 logmsgbot: reedy synchronizing Wikimedia installation... : sscapping for r114268
14:08 logmsgbot: reedy synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'r114268'
09:12 mutante: new URL pointing to Wikipedia Education Program - http://education.wikimedia.org
08:59 mutante: several srv's said they were unable to contact NTP server
08:57 mutante: apache-graceful-all to deploy changed redirects.conf
08:53 logmsgbot: tfinc synchronized wmf-deployment/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'Fixes file pages showing data charge warnings'
07:42 mutante: running authdns-update after adding education.wm for redirect RT:2634
06:21 logmsgbot: tstarling synchronized php-1.19/includes/User.php
05:35 logmsgbot: asher synchronized wmf-config/db.php 'pulling db36 durring db migration'
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 20 02:17:55 UTC 2012
00:25 logmsgbot: awjrichards synchronizing Wikimedia installation... : Reverting MobileFrontend to r113973
00:15 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFrontend.body.php 'r114221'
00:07 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Enabling zero rated mobile access everywhere'
00:05 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Bumpging version number for MobileFrontend resources'

March 19

23:54 logmsgbot: awjrichards synchronizing Wikimedia installation... : Redoing accidentally aborted scap, Pushing changes to MobileFrontend per https://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#19_March.2C_2012
23:51 logmsgbot: awjrichards synchronizing Wikimedia installation... : Pushing changes to MobileFrontend per https://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments#19_March.2C_2012
23:35 AaronSchulz: fixed a few files, on commons and other wikis, with empty oi_archive_name values even though the file was on NFS
23:20 Ryan_Lane: restarting all nginx servers
23:20 Ryan_Lane: added a new proxy to the ssl configuration to temporarily proxy access to wikimania videos being transcoded
21:38 binasher: creating "ops" db and related grants on prod db clusters 2-7 to prep rollout of ishmael / pt-digest beyond s1
21:17 binasher: started enwiki.revision sha1 alter on production side
20:57 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Removing debugging code from MobileFormatter'
20:54 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Addin more debugging code to MobileFormatter'
20:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Addin more debugging code to MobileFormatter'
20:36 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Addin more debugging code to MobileFormatter'
20:31 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/MobileFormatter.php 'Adding debugging code to MobileFormatter'
20:07 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/jquery.articleFeedback/jquery.articleFeedback.js 'r114176'
19:41 Ryan_Lane: bringing virt3 instances back up
19:33 binasher: deploying new frontend squid conf to add support for mf_useformat cookie [rt 2645]
19:18 K4-713: CiviCRM 4.1.1 update script finished executing on prod.
19:12 Ryan_Lane: shutting down virt3 for memory reseating
19:09 K4-713: Started the CiviCRM 4.1.1 update script on prod.
19:08 mark: Rebuilding RAID arrays on brewster
18:58 K4-713: Put production civicrm / drupal instance in offline mode for upgrade
18:54 K4-713: Disabled all production CiviCRM Jenkins jobs, for CiviCRM upgrade.
18:54 cmjohnson1: brewster HDD replacement complete
18:42 mark: Shutting down brewster for HDD replacement
18:26 Jeff_Green: killed kill-slow-queries on db1008 for the duration of the civicrm upgrade
18:19 logmsgbot: nikerabbit synchronized php-1.19/includes/Linker.php 'i18ndeploy r114160'
18:19 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/resources/ext.webfonts.fontlist.js 'i18ndeploy r114160'
18:14 mark: Running smartctl -t long /dev/sdb on brewster
12:58 logmsgbot: hashar synchronized php-1.19/includes/SiteStats.php 'Reenable SiteStatsInit::articles() for bug 35169. SiteStatsInit::doAllAndCommit() still disabled since it breaks the site'
10:28 logmsgbot: tstarling synchronized wmf-config/PoolCounterSettings.php 'increased max queue from 50 to 100 on reports that the limit was reached on the enwiki main page in normal operation'
09:11 mutante: nomcom and langcom wikis look kind of broken , redirecting to pages on incubator with "Error: This page is unprefixed! "
08:49 mutante: making (almost) all private wikis https-only per RT-2565, vi remnant.conf,sync,graceful...
07:30 mutante: running sync-apache after making a change to remnant.conf to make grants.wm https-only
05:09 Ryan_Lane: bringing up most instances on virt3, doing so by project priority
04:42 Ryan_Lane: bringing up all instances on virt4, waiting 30 seconds between instances
04:25 Ryan_Lane: bringing up all instances on virt2, waiting 30 seconds between instances
04:09 Ryan_Lane: bringing up all instances on virt1, waiting 30 seconds between instances
04:00 Ryan_Lane: attempting to bring some instances up
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 19 02:17:17 UTC 2012
01:15 mutante: killed, updated, restarted wikibugs bot per request in RT:2656, should have fixed bugzilla:18831

March 18

23:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35308 - Install mw:Extension:DynamicPageList (Wikimedia) on Portuguese Wikipedia (ptwiki)'
19:20 Ryan_Lane: stopping all labs instances, manually recovering gluster volume
15:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35295 - Missing a in abusefilter-hide-log permission for oversighters'
10:49 Ryan_Lane: rebooting virt4 thanks to defunct libvirt process
03:43 Ryan_Lane: bringing all labs instances up
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 18 02:18:51 UTC 2012
01:09 Ryan_Lane: rebooting all of the virt hosts, gluster is having major issues
00:43 Ryan_Lane: rebooting virt2
00:40 Ryan_Lane: restarting glusterfs on virt2
00:11 Ryan_Lane: rebooting virt3 libirt is non-responsive
00:00 Ryan_Lane: bringing up instances that were downed on virt3

March 17

23:50 Ryan_Lane: virt3 crashed, powercycling it
23:34 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove old comments'
23:24 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Remove old comments'
23:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Simplify config somewhat'
23:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Simplify config somewhat'
23:02 logmsgbot: catrope synchronizing Wikimedia installation... : Have to scap for that AFTv5 change to propagate i18n change
22:52 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ 'r114087'
21:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35289 - Add wikisource logo to mobile wikisource gateway'
02:21 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 17 02:21:03 UTC 2012
01:23 AaronSchulz: FindFilesMissingDBRows.php done, list under aaron/output/missingFileDBRows
00:11 AaronSchulz: Running FindFilesMissingDBRows.php on all wikis

March 16

21:21 binasher: running enwiki.revision sha1 schema migrations on eqiad side
20:12 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild moodbar messages
20:03 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Re-enable moodbar on enwiki'
19:53 logmsgbot: reedy synchronized php-1.19/extensions/MoodBar/ 'r114030'
19:15 Reedy: Ran namespaceDupes on stewardwiki
17:11 RobH: hdd in search1017/1018 replaced per rt 2583
16:54 RobH: search1017 and search1018 coming down for hdd swap
16:53 RobH: cp1017 back in service pool
16:43 RobH: cp1019 back in full service
16:22 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/specials/SpecialCentralAuth.php 'r114021'
16:22 RobH: cp1017 memory error, coming down for troubleshooting.
16:18 RobH: cp1019 memory error cleared after reseating, notes on rt 2651
16:09 mark: Migrated all varnish3 packages to newer varnish packages from git
16:08 RobH: cp1019 coming down for memory error troubleshooting
15:58 RobH: cp1040 repaired per rt 2611
15:48 RobH: cp1040 down for memory replacement
15:09 logmsgbot: reedy synchronized stylize.php 'Test for hume'
15:04 logmsgbot: root synchronized ufg.sql 'test sync to see if hume is fixed'
14:55 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMaintenance/cleanupBug31576.php
14:04 apergos: restarted swift-container-auditor on ms-be3, it had died for some reason
08:07 mutante: i reverted that (star cert for wikitech), no worries i "shred"ded the files
07:51 mutante: replaced self-signed cert on wikitech with the star cert
04:19 mutante: on stafford, deleting spence's puppet report files to free some disk space (they are like the largest report files of all)
03:09 mutante: stafford - - /var/lib/puppet/reports is getting quite large (18G), and we got the first disk space warning, do we want to keep those?
02:45 mutante: killing nrpe on several hosts where it was running as the wrong user again (somehow through the use of dsh)
02:21 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 16 02:21:35 UTC 2012
01:12 mutante: stopping nagios-wm temp. while changing nrpe config (will watch it manually until it's back)
00:54 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add NS to stewardwiki at request of Philippe'
00:51 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Add NS to stewardwiki at request of Philippe'

March 15

23:17 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'r113974'
23:12 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/DisableTemplate.php 'r113973, fixes bug 35249'
23:10 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/modules/ext.articleFeedbackv5/ext.articleFeedbackv5.js 'r113972'
22:59 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
22:59 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Bump AFTv4 event logging percentage from 25% to 100%'
22:57 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/jquery.articleFeedback/jquery.articleFeedback.js
22:48 mutante: purging Lucene monitoring on indexer from db9, remove duplicate service definitions manually anyways (still tons left), run purge script, reload Nagios..
22:24 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
22:23 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Raise AFTv4 event logging percentage from 5% to 25%'
22:21 mutante: getting rid of Swift HTTP checks on non production machines manually (come on spence _purge_ ;P)
22:07 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
22:04 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Raise AFTv4 event logging percentage from 1% to 5%'
21:44 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
21:28 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'r113961'
21:25 pgehres: K4-713 synchronized payments cluster to r113956
21:25 pgehres: disabled credit cards on donate.wikimedia.org
21:21 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'fix fatal'
21:20 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Bump AFTv4 event logging percentage from 0.27% to 1%'
21:19 Ryan_Lane: rebalancing instances gluster volume
21:18 RoanKattouw: That was r113959
21:18 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/ext.articleFeedback/ext.articleFeedback.js
21:11 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedback/modules/ext.articleFeedback/ext.articleFeedback.js 'r113958'
21:09 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ 'r113957'
20:46 mark: bits.pmtpa cluster back online
20:44 RobH: dns update for silver and zhen servers
20:37 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMaintenance/cleanupBug31576.php
19:54 RobH: sq67-sq70 have been reinstalled, but not signed in puppet, not sure if they are ready for that or if there are other items mark needs to change first
19:11 RobH: working on sq67-sq70 reinstalls, disregard alerts
19:00 RobH: db1022 resetup and redeployed per rt 2537 and assigned back to asher
18:51 logmsgbot: reedy synchronizing Wikimedia installation... : Running scap to deal with message changes earlier
18:19 RobH: db1022 coming down for reinstall and resetup of raid per rt 2537
17:55 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/ 'r113940'
17:54 logmsgbot: reedy synchronized php-1.19/extensions/CheckUser/ 'r113940'
17:53 logmsgbot: reedy synchronized php-1.19/extensions/wikihiero/modules/ext.wikihiero.css 'r113940'
17:52 logmsgbot: reedy synchronized php-1.19/extensions/NewUserMessage/NewUserMessage.class.php 'r113940'
17:41 logmsgbot: reedy synchronized php-1.19/includes/RecentChange.php 'r113938'
17:38 logmsgbot: reedy synchronized php-1.19/resources/mediawiki/mediawiki.util.js 'r113936'
17:37 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialUndelete.php 'r113936'
17:32 logmsgbot: reedy synchronized php-1.19/languages/messages/ 'r113935'
17:31 logmsgbot: reedy synchronized php-1.19/resources/ 'r113935'
17:31 logmsgbot: reedy synchronized php-1.19/includes/ 'r113935'
17:16 logmsgbot: reedy synchronized php-1.19/includes/SkinTemplate.php 'r113932'
16:13 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMaintenance/cleanupBug31576.php 'r113929'
15:15 mark: Created git repo operations/debs/varnish in gerrit
14:06 apergos: disabled moodbar temporarily on en wikii, see bug 35245
14:02 logmsgbot: ariel synchronized wmf-config/InitialiseSettings.php 'emergency disable of feedback dashboard (right config var this time?)'
13:51 logmsgbot: ariel synchronized wmf-config/InitialiseSettings.php 'emergency disable of feedback dashboard'
13:11 apergos: on screen as root on dataset1001, copying to gluster volume; if this causes problems feel free to shoot it. ( cp -a 20120211 /mnt/glusterpublicdata/public/enwiki/ )
09:08 mutante: ran puppet on mw1020
08:12 mutante: installing apache,apt,cron,mysql-client upgrades on spence
07:51 mutante: messed with /var/lib/dpkg/status on hume to fix broken packages/remove "marked for purging" on libmysql-php5 without removing a ton of other packages, rather hackish but seems fine anyways, like not broken anymore on simulated dist-upgrade etc
07:01 mutante: uprading apache and apt on hume
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 15 02:17:35 UTC 2012
01:26 Ryan_Lane: labsconsole was missing libapache2-mod-php5. puppet must have tried to upgrade a package unsuccessfully
01:22 mutante: planet back up (installed libapache2-mod-php5 which installed apache2-mpm-prefork and removed apache2-mpm-worker)
01:19 mutante: planet down - apache on singer, syntax error in site config "Invalid command 'php_admin_flag'"
01:03 mutante: fixing nrpe "unable to read output" raid check on srv197,207,243,,244,253.. (nrpe running as wrong user)

March 14

23:16 maplebed: installed the swiftcleaner to run daily from iron. see root's crontab for more info.
20:41 binasher: disabled log_queries_not_using_indexes on all core dbs
20:33 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header for image support debugging'
20:30 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header for image support debugging'
19:29 maplebed: rebooting ms-be1 to enable hyperthreading (and make it the same as all the other ms-be hosts)
19:06 preilly: pushing x-images header for vary support
19:06 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing header for image support debugging'
19:05 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/MobileFrontend.body.php 'zero needs to add x-images to vary header'
18:58 maplebed: ms-be5 is back in rotatino
18:31 preilly: push zero change for carrier testing
18:31 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
16:19 RobH: updating dns for new domain wikimediacommons.pt (nameservers not yet pointed at us)
16:04 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'add vcs for extdist updates'
13:03 RobH: cp1029-cp1035 all installed and ready for varnish deployment, puppet has been run
08:24 mutante: running "apt-get -f install" on snapshot3 to fix dpkg, which installed mysql-client- and client-core-5.1
08:02 mutante: stop/start memcached on srv254,srv255,srv257
07:51 mutante: restarting mecached on marmontel
07:51 mutante: fixing owa[1-3] Swift HTTP commands manually
03:44 mutante: ekrem - user agent "AppleDictionaryService" requests cause temp. WAP outage ..it seems
03:38 mutante: free some disk space on spence - deleted user.log.1 on spence, compressing messages.1, apt-get clean,...
02:52 RobH: cp1032-cp1035 reinstall issue wiped mbr causing issues, will reinstall in my AM
02:49 RobH: revoked, cp1032 is some reason in grub error, and its too late at night for me to work on it, will troubleshoot tomorrow
02:48 RobH: realized i forgot to log hours ago that cp1029-cp1036 are installed with puppet run, ready for varnish deployment tomorrow
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 14 02:17:13 UTC 2012

March 13

23:51 mutante: upgrading bugzilla to 4.0.5
23:42 logmsgbot: reedy synchronized php-1.19/resources/jquery/jquery.textSelection.js 'r113786'
23:14 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
22:47 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113779'
22:44 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/beta_common.css 'r113774'
22:44 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/stylesheets/common.css 'r113774'
22:43 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113771'
22:42 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/api/ApiQueryExcerpts.php 'r113774'
22:27 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Removing moile URL template for tewtwiki'
21:44 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
21:44 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
21:31 logmsgbot: asher synchronized wmf-config/db.php 'replacing db18 with new s7 slave db56'
21:19 binasher: started slaving db56 from db37
20:30 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
19:27 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero needed for carrier testing'
19:17 RobH: iron updated to use ipmi_mgmt script
19:08 preilly: pushing changes for zero to mswiki
19:08 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
19:08 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
19:05 binasher: streaming hotbackup of db1041 to db56 (new s7 slave replacing db18)
18:10 maplebed: failover successful, restarted pybal on lvs4, failback successful.
18:09 binasher: power cycling db1020, which also froze this morning
18:08 maplebed: stopping pybal on lvs4 - should fail over to lvs3
17:47 maplebed: pybal restarted on lvs3
17:47 binasher: power cycling db1040, crashed again
17:30 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Bug 35183 - p include extensions/Renameuser/Renameuser.php instead of extensions/Renameuser/SpecialRenameuser.php'
17:12 mark: Sending all normally-pmtpa upload traffic to upload-lb.eqiad
17:05 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
16:59 preilly: add disable images support to mswiki under zero domain
16:59 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'add disable images option for mswiki on zero domain'
16:58 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add disable images option for mswiki on zero domain'
16:46 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add ZeroRatedMobileAccess extension to mswiki remove from mywiki'
16:44 mark: Sending traffic from Japan, India, Mexico to upload-lb.eqiad
16:37 LeslieCarr: reinstalling neon
16:23 apergos: stole some free space from the phys volume on ms1002 to give us more time for the rsync to keep going til after the move to swift etc
15:28 mark: Sending traffic from the USA to upload-lb.eqiad
15:27 mark: Rebooting lvs1005 with upgraded kernel/packages
15:12 LeslieCarr: manually deleted cp1025 info from nagios config file - nagios restored for now
14:51 mark: Sending traffic from Canada to upload-lb.eqiad
14:32 mark: Sending traffic from Brazil to upload-lb.eqiad
13:58 mark: Sending traffic from Argentina to upload-lb.eqiad
12:58 mark: Seeding the eqiad upload caches from live upload requests
11:59 mark: Setup squid logging to oxygen, with oxygen relaying to multicast 233.58.59.1
11:02 mark: Rebooting lvs1002 with kernel updates
10:17 mark: Rebooting manutius with newer 2.6.36 kernel to attempt avoiding i/o kernel bug with torrus
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 13 02:18:03 UTC 2012

March 12

22:55 K4-713: synchronized payments cluster to r113679, and tweaked the anti-fraud rules
21:51 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r113671'
21:51 logmsgbot: catrope synchronized php-1.19/extensions/ArticleFeedbackv5/ArticleFeedbackv5.hooks.php 'r113671'
21:44 Reedy: Running foreachwiki extensions/WikimediaMaintenance/cleanupBug31576.php in screen as me on hume
21:39 RobH: search1014 repaired per rt 2483
20:26 RobH: cp1040 coming down for hardware stuffs
18:19 Nikerabbit: Assuming scap has finished
17:48 logmsgbot: nikerabbit synchronizing Wikimedia installation... : Deploying updated Translate
17:46 notpeter: restarting indexer on searchidx2
17:24 logmsgbot: nikerabbit synchronized php-1.19/includes/Title.php 'r113635'
17:22 logmsgbot: nikerabbit synchronized php-1.19/languages/ 'r113635'
17:14 logmsgbot: nikerabbit synchronized php-1.19/extensions/Narayam/ 'Updating Narayam'
17:13 mark: PXE booting cp1025-cp1028
17:11 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'Updating WebFonts'
15:16 mark: Rebooted manutius, stuck in a similar state as streber always did
06:10 mutante: turning off debug mode in nagios-nrpe, again had to kill it , restart fails
05:53 mutante: dunno, copper was stuck (no mgmt output after reboot) but powercycling it and back
05:43 mutante: rebooting copper to make sure grub update didnt break it and asked for restart anyways
05:37 mutante: copper - installing (security) updates (apt,grub,openssl,ruby,libc6..)
04:19 mutante: wanted to restart nagios-nrpe-server on spence with debug=1 to investigate permission issue. arr! "Address already in use" "cant write to pidfile", killed the one started on Feb18, and reordered allowed_hosts, spence talks to itself again now :p
03:40 mutante: same (and nscd) on fenari
03:35 mutante: upgrading libc6 and related packages on spence
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 12 02:17:28 UTC 2012

March 11

08:14 apergos: restarted lighttp on dataset2
07:49 apergos: removed current htcp log file, restarted purger, it seems to be logging normallynow
07:35 apergos: current ls shows 17416851456 2012-03-11 07:34 HTCPpurger.log while current du -sh shows 175M for /var/log. Sparse file that gets rotated badly? lots of leading nulls (many gb worth), why?
07:33 apergos: on ms1004 the HTCPpurger.log file after rotation was 17 gb, filling the disk. Removed it.
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 11 02:17:35 UTC 2012

March 10

22:09 Reedy: Make that wikimania2012, not wikimediawiki
22:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable anon page creation for wikimediawiki'
19:28 binasher: set sync_binlog = 1 on all current masters and eqiad dbs
19:22 binasher: reslaved db1033
07:03 mutante: ran puppet on db1022, another one that works fine manually but somehow did not by itself
05:11 mutante: doing more (cp*, db*, msbe-* ,mw*) by hand / for loop
05:01 mutante: starting nagios-nrpe-server on all via dsh (fail to restart on config change issue)
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 10 02:16:57 UTC 2012
01:07 maplebed: started swiftcleaner on owa1 looking for (and purging) bad objects
01:06 maplebed: rebalanced the swift rings to finish decreasing traffic sent to ms1 and ms2
00:18 Ryan_Lane: powercycling ssl1003
00:18 Ryan_Lane: powercycling ssl1001

March 9

20:34 notpeter: stopping search indexer on searchidx2 for fresh rsync to searchidx1001
19:58 preilly: pushed change to remove description from landing page
19:57 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
18:59 Ryan_Lane: sending test.m.wikipedia.org to the same place as test.wikipedia.org via squid
18:58 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Fixing wgMobileUrlTemplate settings for domains that do not have .m. domains configured'
18:48 logmsgbot: reedy synchronized php-1.19/extensions/WikiLove/modules/ext.wikiLove/ext.wikiLove.css 'r113497'
18:40 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Changing the way in which wgMobileUrlTemplate is configurable by InitialiseSettings.php'
18:39 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling wgMobileUrlTemplate for testwiki - hopefully for real this time'
18:34 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Making wgMobileUrlTemplate configurable by InitialiseSettings.php'
18:34 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Disabling wgMobileUrlTemplate for testwiki'
17:40 logmsgbot: awjrichards synchronized php/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113489'
17:32 maplebed: set swift storage device weight on ms2 to 0 and pushed out rings
15:52 apergos: cleared up a little bit of space on root partition of snapshot2, but that's about it. I hope we never have 3 versions of mw in test at the same time, the tmp caches will kill us
15:52 mark: Turned off vcc_err_unref on all varnish servers, so varnish doesn't complain when ACLs/probes/backends are unused
15:44 Jeff_Green: hume apt upgrades, puppetd --test, switch to mysql 5.1.53-fb3753-wm1
06:38 Ryan_Lane: reloading autofs on all labs instances
06:13 Tim: running svn cleanup on extdist trunk
04:18 Tim: switched php and wmf-deployment symlinks over to php-1.19 instead of php-1.18
04:18 Tim: restarted morebots
00:57 pp-pdf2: updated pyfribidi to 0.11.0 fixing https://github.com/pediapress/pyfribidi/issues/2
00:57 pp-pdf3: updated pyfribidi to 0.11.0 fixing https://github.com/pediapress/pyfribidi/issues/2
00:57 pp-pdf1: updated pyfribidi to 0.11.0 fixing https://github.com/pediapress/pyfribidi/issues/2
00:38 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add ZeroRatedMobileAccess extension to mywiki'
00:32 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/beta_opensearch.js 'fixes to code push'
00:32 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/beta_opensearch.min.js 'fixes to code push'
00:27 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/templates/ApplicationTemplate.php 'fixes to code push'
00:27 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/opensearch.js 'fixes to code push'
00:27 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/javascripts/opensearch.min.js 'fixes to code push'
00:01 RobH: oxygen install done, booting successfully after multiple tests, now running puppet for initial config
00:01 K4-713: updated the paypal IPN listener on aluminium to r1450

March 8

23:57 logmsgbot: awjrichards synchronized php-1.19/extensions/MobileFrontend/templates/ApplicationTemplate.php 'r113428'
23:56 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'per-wiki memory limit configuration, with extra memory for zh* for converter tables'
23:55 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'per-wiki memory limit configuration, with extra memory for zh* for converter tables'
23:42 mutante: rebooting ms-be5
23:37 logmsgbot: awjrichards synchronizing Wikimedia installation... : Updating MobileFrontend per https://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments
23:24 binasher: streaming hotbacking of db1017 to db1033 - no snapshots of enwiki in eqiad til db1033 is back
23:19 Tim: started changing the php symlink to 1.19 instead of 1.18, but then changed my mind and changed it back.
23:16 logmsgbot: tstarling synchronizing Wikimedia installation... :
23:07 logmsgbot: tstarling synchronized php-1.19/extensions/ExtensionDistributor/svn-invoker.conf
23:01 logmsgbot: asher synchronized wmf-config/db.php 'returning db24 to service'
22:58 maplebed: powercycled ms-be3 - it crashed 2.5 hours ag.
22:52 logmsgbot: asher synchronized wmf-config/db.php 'pulling db18'
22:40 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend 'deployed r113413, r113414'
22:39 LeslieCarr: poked hole to allow labs machines to reach gluster machines in tampa
22:13 logmsgbot: catrope synchronized php-1.19/includes/MagicWord.php 'r113411'
22:13 logmsgbot: catrope synchronized php-1.19/includes/Cdb.php 'r113411'
22:13 logmsgbot: catrope synchronized php-1.19/includes/WebRequest.php 'r113411'
22:11 RobH: udpating dns for oxygen
22:03 RobH: oxygen coming down for reinstall
20:42 cmjohnson1: power to msw-c1-sdtpa restore
20:40 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
20:40 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.php 'changes for zero'
20:39 cmjohnson1: removing and relocating power to msw-c1-sdtpa
19:38 logmsgbot: catrope synchronizing Wikimedia installation... : ArticleFeedbackv5 updates
19:34 RoanKattouw: Running scap for ArticleFeedbackv5 updates
19:30 RoanKattouw: Running AFTv5 schema changes on enwiki
19:29 logmsgbot: catrope synchronized wmf-config/CommonSettings.php '$wgArticleFeedbackv5OversightEmails'
19:29 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php '$wgArticleFeedbackv5OversightEmails'
19:26 RoanKattouw: Applying AFTv5 schema changes to en_labswikimedia
19:09 preilly: push zero rated changes
19:09 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.i18n.php 'changes for zero'
19:09 logmsgbot: preilly synchronized php-1.19/extensions/ZeroRatedMobileAccess/ZeroRatedMobileAccess.body.php 'changes for zero'
19:04 RoanKattouw: Clearing message blobs
18:53 RoanKattouw: Running rebuildLocalisationCache.php
18:49 binasher: power cycling cp1044
18:46 binasher: purging entire mobile varnish cache - the main mobile template included robots no-follow
18:43 preilly: needed to fix a google issue with robots
18:43 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/ApplicationTemplate.php 'remove ROBOTS metatag'
18:40 logmsgbot: preilly synchronized php-1.19/extensions/MobileFrontend/ApplicationTemplate.php 'remove ROBOTS metatag'
18:40 binasher: deploying new squid frontend.conf to fix epic fail - all googlebot traffic was being redirected to mobile. now just if it's mobilegooglebot.
18:29 RoanKattouw: Applying AFTv5 schema changes on testwiki
18:27 RoanKattouw: Pushing new AFTv5 code to testwiki, do not sync to the live site just yet
17:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'ptwikipedia to ptwiki'
17:14 cmjohnson1: shutting down db18 for memory testing
16:57 RobH: search1014 still down per rt2483
16:47 maplebed: took ms-be5 out of rotation in the swift cluster - it's crashed 3 times now.
16:36 logmsgbot: reedy synchronized php-1.19/extensions/ExtensionDistributor/ExtensionDistributor_body.php 'r113368'
16:31 logmsgbot: reedy synchronized php-1.19/extensions/ExtensionDistributor/ExtensionDistributor_body.php 'Revert live hack because it works, will come in properly'
16:30 logmsgbot: reedy synchronized php-1.19/extensions/ExtensionDistributor/ExtensionDistributor_body.php 'Test for bug 27246'
16:16 RobH: search1008 repaired
15:52 RobH: mw1103 finally repaired and ready for os and such
14:48 pp-pdf1: installed python faulthandler 2.1
14:47 pp-pdf3: installed python faulthandler 2.1
14:47 pp-pdf2: installed python faulthandler 2.1
14:24 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 35012 - Namespace aliases for wikipedia and wikipedia-talk namespaces on Sanskrit wiki'
09:17 mutante: running puppet on mw1010 - finished quickly without problems - uh, wonder why Nagios reported puppet freshness then
08:22 mutante: cp1019 - Hitting F1 to continue reboot ( "Alert! System fatal error during previous boot")
08:21 mutante: cp1019 went down, then rebooted by itself (i think) after showing "idrac-8W82BP1 Severity: Non Recoverable, SEL:CPU Machine Chk: Processor sensor, transition to non-recoverable was asserted"
07:54 mutante: cadmium fixed by adding groups::wikidev
07:41 mutante: puppet on cadmium broken due to dependency Group[500] for User[catrope]
07:20 mutante: ms1004 ran out of disk - caused by 17G HTCPurger.log.1, trying to gzip it now
06:52 logmsgbot: tstarling synchronized multiversion/MWMultiVersion.php
06:51 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files:
03:04 Guest32353: powercycled ms-be5; it has been unresponsive for 2 hours.
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 8 02:18:02 UTC 2012
01:32 AaronSchulz: fixBug34995.php done
01:26 AaronSchulz: running fixBug34995 on all wikis
00:17 Ryan_Lane: adding zero cnames
00:16 Ryan_Lane: installing newer wikimedia-task-dns-auth on all dns servers
00:15 Ryan_Lane: added wikimedia-task-dns-auth_0.18 to the repo, to add support for zero

March 7

23:05 logmsgbot: aaron synchronized php-1.19/includes/filerepo/file/LocalFile.php 'deployed r113319'
22:39 maplebed: set swift weight for ms1 to 0 initiating the process to move data off the host in preparation for decomissioning it.
21:17 Jeff_Green: running apt upgrades and puppetd --test on srv194, srv197, srv203, srv212, srv213, srv230, srv244, srv245, srv252, srv282 and manually restarting nrpe because they're reporting funky in nagios
20:20 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
20:17 Jeff_Green: yet another redirects.conf change, per RT#2498 redirect wikimedia.com-->wikimedia.org
20:05 binasher: reverted no-pagecache rsync on search nodes - without corresponding index warmup in lsearchd, it just pushes back the pain a bit and does more harm than good
20:04 binasher: deployed support for zero.wikipedia.org and carrier tagging to mobile varnish servers
19:38 logmsgbot: catrope synchronized php-1.19/includes/resourceloader/ 'r113278'
19:27 Jeff_Green: manual apt-upgrade, puppetd --refresh, and repeat on srv265 because it was running on outdated apache config
18:44 RobH: correction sq39
18:36 RobH: pulled sq39 from text pybal config, pulled sq46 from upload pybal config
18:36 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
18:36 logmsgbot: catrope synchronized php-1.19/extensions/CustomUserSignup/modules/AccountCreationUserBucket.js 'touch'
18:12 RobH: shutting down sq38 and sq46 per rt 2581 for testing
16:02 cmjohnson1: replacing hdd for disk 10 on db22
16:00 cmjohnson1: pulling disk 10 from db22
13:28 mark: Removed torrus from streber
13:00 pp-pdf2: updated mwlib to 0.13.6
13:00 pp-pdf3: updated mwlib to 0.13.6
13:00 pp-pdf1: updated mwlib to 0.13.6
11:29 logmsgbot: hashar synchronizing Wikimedia installation... : trigger a rebuild of l10n cache
04:53 mutante: added ms-be5 drives to swift cluster
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Wed Mar 7 02:18:01 UTC 2012
02:11 logmsgbot: catrope synchronized php-1.19/includes/api/ApiBase.php 'r113212'
01:58 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/FileBackend.php 'bumped max file size to 4GiB'
00:27 maplebed: put ms-be4 into rotation as a new production swift backend storage node
00:21 maplebed: put ms-be3 into rotation as a new production swift backend storage node
00:05 maplebed: put ms-be2 into rotation as a new production swift backend storage node

March 6

23:54 logmsgbot: catrope synchronized php-1.19/extensions/CustomUserSignup/ 'Belated sync of r113056'
23:52 binasher: deploying new frontend squid config to include googlebot in mobile redirects
23:36 logmsgbot: reedy synchronized php-1.19/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.class.php 'r113200 reverting r113198'
23:25 Tim: patched 5xx-filter.c live on locke and reloaded udp2log to stop the segfaults
23:20 logmsgbot: reedy synchronized php-1.19/extensions/SyntaxHighlight_GeSHi/SyntaxHighlight_GeSHi.class.php 'r113198'
21:46 logmsgbot: catrope synchronized php-1.19/extensions/CentralAuth/specials/SpecialCentralAuth.php 'r113183'
21:41 notpeter: restarting puppet on brewster
21:03 Jeff_Green: pushing another change to redirects.conf and doing a graceful apache restart
20:32 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild message cache stuffs for r113129
20:31 Jeff_Green: disabled Global Connect nagios test (check_gcsip) on payments cluster because GC is down and nagios is spammy
20:25 notpeter: reimaging search1001-1020 with new partman recipe :/
20:22 notpeter: temp stopping puppet on brewster
20:21 logmsgbot: reedy synchronized php-1.19/resources/mediawiki.action/mediawiki.action.edit.js 'r113175'
20:20 logmsgbot: reedy synchronized php-1.19/maintenance/populateRevisionSha1.php 'r113175'
20:19 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialContributions.php 'r113175'
20:18 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialUserlogin.php 'r113176'
20:00 pp-pdf1: installed log-wikimedia-operations (which can be used for automated logging to #wikimedia-operations)
19:53 Ryan_Lane: restarting labs mysql to allow for more connections
19:26 Ryan_Lane: installing nova-api on virt0
19:09 Ryan_Lane: upping FLAGS.sql_max_pool_size for nova-api
18:47 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
18:46 Ryan_Lane: rebooting all instances
18:34 Ryan_Lane: restarting nova-network on virt2
18:19 Ryan_Lane: rebooting virt1
18:15 Ryan_Lane: rebooting virt2
18:11 Ryan_Lane: rebooting virt3
18:07 Ryan_Lane: rebooting virt4
17:57 Ryan_Lane: taking the opportunity to apply security updates to virt0-4
16:25 logmsgbot: catrope synchronized docroot/foundation/FrameResize.html 'Put Jobvite frame resize file in foundationwiki docroot per Erik'
11:40 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: switching sr* to 1.19
11:15 logmsgbot: hashar synchronized php-1.19/languages/messages/MessagesSa.php 'r1113039 for bug 34938 : title is sometime empty on Sanskrit wikis'
11:13 logmsgbot: tstarling synchronized php-1.19/includes/OutputPage.php 'r113128'
10:41 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: switching zh* from 1.18 to 1.19
08:36 mutante: on hooper: puppet broken due to dependency Package[libapache2-mod-php5] for Service[apache2]
03:33 mutante: rebooting bast1001 for kernel upgrade
03:32 mutante: upgrading apache2 packages, base-files, kernel, several libs on bast1001
03:27 mutante: installing a couple upgrades on fenari (apache2-utils, update-manager-core, cvs, ruby, libxml*, libopenssl-ruby*...)
02:37 logmsgbot: LocalisationUpdate completed (1.18) at Tue Mar 6 02:37:06 UTC 2012
02:36 logmsgbot: tstarling synchronizing Wikimedia installation... : updating to r113119
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Tue Mar 6 02:18:13 UTC 2012
01:27 Jeff_Green: manually updated packages and restarted apache on srv198, srv229, srv262, srv268, mw40 because their apache redirect configs failed to update after sync-apache and restart
01:07 Jeff_Green: another adjustment to redirects.conf and apache-graceful-all for RT#2488

March 5

22:24 Jeff_Green: modified redirects.conf per RT #2488
21:21 Reedy: Ran foreachwiki cleanupUploadStash.php
20:36 maplebed: enabled swift for 100% of thumbnails in production
18:18 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'i18ndeploy r113058'
18:11 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'WebFonts: bugwiki bug 34550; sawikisource bug 34159; amwiktionary amwikiquote bug 34700'
18:01 mark: Raised MTU between cr1-sdtpa - (csw1-sdtpa) - cr2-pmtpa to 9192
17:35 Jeff_Green: removed 3GB db30:/tmp/gmond.log and force-restarted gmond b/c the init script failed to restart it
17:16 Jeff_Green: adjusted LVS partitions on hume, moved /usr/local/apache to a new 5GB mount
15:18 mark: Fixed DNS resolving on the core routers by allowing DNS replies in the loopback filter
14:44 logmsgbot: reedy synchronized php-1.19/includes/Title.php 'r113036'
14:43 logmsgbot: reedy synchronized php-1.19/includes/AjaxResponse.php 'r113036'
14:35 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/ 'r113035'
14:34 logmsgbot: reedy synchronized php-1.19/extensions/CategoryTree/ 'r113035'
13:50 mark: Set increased OSPF/OSPFv3 metric 30 on both directions of the link cr1-eqiad:xe-5/2/1 <--> cr1-sdtpa:xe-0/0/1, to combat higher than normal jitter and packet loss on the link
12:53 mark: Upgraded observium to latest version
09:41 mutante: restarting memcached on marmontel
09:40 mutante: restarting squid backend on knsq25
06:52 Ryan_Lane: all of the instances are accessing the file descriptors of files inside of the _base directory, and fuse has an issue with this. gluster can't recreate the base directory because of the processes holding open the old one.
06:50 Ryan_Lane: I've corrupted the _base directory on the instance's glusterfs share. I'm recovering the files from file descriptors using lsof. Not totally sure how I'm going to get the _base directory back, yet.
02:33 logmsgbot: LocalisationUpdate completed (1.18) at Mon Mar 5 02:33:04 UTC 2012
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Mon Mar 5 02:16:39 UTC 2012

March 4

21:48 logmsgbot: reedy synchronized wmf-config/ 'Bug 32726 - Set =true for Commons'
21:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix .'
21:41 logmsgbot: reedy synchronized wmf-config/ 'Bug 32726 - Set =true for Commons'
21:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34897 - Enable Special:Import on Catalan wikisource'
20:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34567 - New logo for Arabic Wiktionary'
20:31 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34715 - Please modify the import sources for the Spanish Wikiversity'
20:29 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34694 - Install the Quiz extension on de.wikibooks'
20:25 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wmgMoodBarCutoffTime'
20:25 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Create wmgMoodBarCutoffTime'
20:14 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Variablise moodbarconfig infoUrl'
20:12 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Variablise moodbarconfig infoUrl'
20:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34618 - Install MoodBar on fr.wikisource'
20:01 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34766 - Logo of Sanskrit Wikisource'
19:53 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34867 - Switch Sango wiktionary logo'
19:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34931 - Add namespaces aliases on as.wikipedia.org'
19:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34690 - Changing the name in the title bar to Assamese'
02:35 logmsgbot: LocalisationUpdate completed (1.18) at Sun Mar 4 02:35:16 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Sun Mar 4 02:17:34 UTC 2012

March 3

18:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34728 - Categories added to user pages by Babel in pt.wiktionary'
13:04 logmsgbot: aaron synchronized php-1.19/includes/Revision.php 'deployed r112949'
02:35 logmsgbot: LocalisationUpdate completed (1.18) at Sat Mar 3 02:35:08 UTC 2012
02:18 logmsgbot: LocalisationUpdate completed (1.19) at Sat Mar 3 02:18:04 UTC 2012

March 2

21:41 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'disabled logging hack'
20:47 Jeff_Green: added redirect/301 from http://static.wikimedia.org --> http://dumps.wikimedia.org now that archival static html dumps are located there
19:53 mark: Decommissioned csw5-pmtpa from AS14907 service. rest in pieces ;)
19:10 mark: Did a hot cut to remove csw5-pmtpa out of the path of cr1-sdtpa -> csw1-sdtpa -> csw5-pmtpa -> cr2-pmtpa
17:46 cmjohnson1: powering down msw1-pmtpa for relcocation to d1-pmtpa
17:40 cmjohnson1: disconnecting management fiber from msw1-pmtpa
16:59 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/CentralAuth.php 'r112904'
16:55 RobH: ms-be4 boot order fixed, fixing ms-be5 & ms-be2
16:49 RobH: fixed boot order on ms-be3, fixing ms-be4
16:33 RobH: poking at bios on ms-be3
16:05 RobH: wikitech outage resolved
15:20 RobH: shutdown frdev offsite vm per email to engineering last week
15:18 RobH: backing up wikitech in hopes of upgrading some of its software
08:36 apergos: on ms1004, low on space, HTCPpurger.log.1 had about 16 gb of nulls before any real content, I tailed off the real stuff and tossed the original. The current log file has the same problem, why?
02:34 logmsgbot: LocalisationUpdate completed (1.18) at Fri Mar 2 02:34:34 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.19) at Fri Mar 2 02:17:51 UTC 2012
01:36 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/lockmanager/LockManager.php 'deployed r112867'
00:41 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree 'deployed r112862'

March 1

23:33 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'log agent'
23:29 logmsgbot: reedy synchronizing Wikimedia installation... : Push message updates from r112848
23:22 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'logging fix'
23:22 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php
23:20 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php
23:17 logmsgbot: reedy synchronized php-1.19/includes/filerepo/backend/FSFileBackend.php 'r112850'
23:16 logmsgbot: reedy synchronized php-1.19/includes/Article.php 'r112850'
23:11 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php
23:06 logmsgbot: reedy synchronized php-1.19/extensions/MoodBar/ApiFeedbackDashboardResponse.php 'r112848'
23:05 logmsgbot: reedy synchronized php-1.19/extensions/Collection/Collection.body.php 'r112848'
22:12 logmsgbot: aaron synchronized php-1.19/includes/specials/SpecialContributions.php 'deployed r112844'
22:06 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend 'deployed r112841'
21:04 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'enabled FileBackend debug log'
19:57 cmjohnson1: replaced disk 3 labstore1 chassis
19:54 cmjohnson1: removing disk 3 from labstore1 chassis
19:47 Ryan_Lane: restarted memcached on virt0
19:15 logmsgbot: reedy synchronized php-1.19/cache/interwiki.cdb 'Updating interwiki cache'
17:39 Jeff_Green: Removed >5GB /tmp/gmond.log on db25, db32, db33, db37
17:36 logmsgbot: hashar synchronized php-1.19/includes/EditPage.php 'r112819 - Bug 34849 diff during editing an old version compares to the old version instead of the current one'
17:36 Jeff_Green: Removed >5GB /tmp/gmond.log on db13
17:35 Jeff_Green: Removed >5GB /tmp/gmond.log on db11
17:25 Jeff_Green: Removed 5.3GB /tmp/gmond.log on db1018
17:24 Jeff_Green: Removed 5.3GB /tmp/gmond.log on db1017
17:13 Jeff_Green: Removed 4.8GB /tmp/gmond.log on db1008. Tried to resist urge to make snarky comment about ganglia but failed.
14:54 RobH: strontium server rebooting to set HT to enabled
14:26 mark: Moving bits traffic back from pmtpa to eqiad
14:24 mark: Cleared dnsmasq cache on virt2
14:16 mark: csw5-pmtpa: Mar 1 14:01:42:A:Power Supply 2 , 2nd from left, bad
14:14 mark: mr1-pmtpa rebooted/lost power for some reason
14:07 mark: pmtpa/sdtpa management network went down
13:54 mark: Pooled new eqiad bits servers strontium and palladium
12:45 logmsgbot: hashar synchronized php-1.19/includes/specials/SpecialWatchlist.php 'r111882 for Bug 34835 - watchlist shows times in UTC'
10:53 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: reverting sr* wikis back to 1.18 per Siebrand's recommendation due to bug 34832
06:26 logmsgbot: tstarling synchronized php-1.19/extensions/SpamBlacklist/SpamBlacklist.php 'r112781'
05:46 maplebed: started swift deletion run on owa1, 2, and 3.
02:33 logmsgbot: LocalisationUpdate completed (1.18) at Thu Mar 1 02:33:53 UTC 2012
02:16 logmsgbot: LocalisationUpdate completed (1.19) at Thu Mar 1 02:16:52 UTC 2012
02:15 Ryan_Lane: vlan tagged virt5's eth0 and eth1 ports on csw1-sdtpa
02:12 logmsgbot: aaron synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'debug logging'
02:02 logmsgbot: reedy synchronized php-1.19/resources/mediawiki.action/mediawiki.action.history.diff.css 'r112750'
01:59 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: all zh wikis back to 1.18
01:50 logmsgbot: aaron synchronized php-1.19/extensions/WikiLove 'deployed r112758'
01:37 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Last 265 wikipedias over to 1.19wmf1
01:28 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: s7 to 1.19wmf1
01:23 logmsgbot: reedy synchronized php-1.19/extensions/CategoryTree/CategoryTreeFunctions.php 'r112754'
01:09 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: s2 to 1.19wmf1
00:58 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Meanwhile, on wikipedia.... Hello ruwiki!
00:48 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: jawiki to 1.19wmf1
00:31 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: frwiki to 1.19wmf1
00:21 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: dewiki to 1.19wmf1
00:05 logmsgbot: tstarling synchronized php-1.19/extensions/Collection/Collection.body.php 'r112745'

February 29

23:42 logmsgbot: reedy synchronized php-1.19/extensions/ArticleFeedbackv5/api/ApiArticleFeedbackv5Utils.php 'r112743'
23:38 logmsgbot: catrope synchronized php-1.19/extensions/LiquidThreads/lqt.css 'r112742'
23:35 maplebed: trying a run of swiftcleaner against the commons a2 shard on swift.
23:34 logmsgbot: reedy synchronized php-1.19/extensions/Collection/Collection.body.php 'r112741'
23:28 logmsgbot: reedy synchronized php-1.19/extensions/ArticleFeedbackv5/api/ApiViewRatingsArticleFeedbackv5.php 'r112737'
23:08 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki funtime
22:30 Ryan_Lane: restarting gerrit on manganese to enable replication
22:18 Ryan_Lane: stopped ircecho on formey and started it on manganese
22:18 Ryan_Lane: installed python-paramiko on manganese. needs to be puppetized
22:12 Ryan_Lane: reversing gerrit replication from formey -> manganese to manganese -> formey
22:12 Ryan_Lane: gerrit moved to manganese.
22:09 Ryan_Lane: replacing ssh_host_key on manganese for gerrit with the same one on formey
22:09 Ryan_Lane: stopping gerrit on manganese
22:02 Ryan_Lane: stopped gerrit service on formey, moving to manganese
21:07 logmsgbot: reedy synchronized php-1.19/extensions/OggHandler/ 'r112725'
21:00 logmsgbot: hashar synchronized php-1.19/extensions/ApiSandbox/ext.apiSandbox.js '(bug 34790) Pressing "Make Request" should not make two requests to api.php'
20:58 Ryan_Lane: restarting nova-compute on virt4
19:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Turn wmgReduceStartupExpiry on for wikipedia projects, off for nl/pl wiki ahead of tonights deploy'
19:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Turn wmgReduceStartupExpiry off by default. Needs to go to wikipedia only later on'
19:23 logmsgbot: aaron synchronized wmf-config/PrivateSettings.php 'updating swift auth'
19:08 RobH: dns update for virt5 mgmt
17:52 logmsgbot: reedy synchronized php-1.19/includes/api/ApiQueryLogEvents.php 'r112701'
17:50 logmsgbot: aaron synchronized wmf-config/InitialiseSettings.php 'Give editors patrolmarks right on plwikisource & plwiktionary'
17:42 logmsgbot: reedy synchronized php-1.19/includes/api/ApiQueryLogEvents.php 'Push back to 1.19wmf1 head'
17:38 logmsgbot: aaron synchronized wmf-config/InitialiseSettings.php 'Give editors patrolmarks right on plwiki'
17:36 logmsgbot: aaron synchronized wmf-config/InitialiseSettings.php 'Give editors patrolmarks right on plwiki'
17:33 logmsgbot: aaron synchronized wmf-config/InitialiseSettings.php 'A few tab w/s tweaks'
17:08 logmsgbot: reedy synchronized php-1.19/includes/api/ApiQueryLogEvents.php 'Test reverting r112532, merge of r112374'
15:14 Reedy: Running a long slow sql query against db1020 in screen on fenari to pull globalusage titles with spaces in them
14:36 logmsgbot: reedy synchronized php-1.18/extensions/GlobalUsage/GlobalUsage_body.php 'r112689'
14:26 logmsgbot: reedy synchronized php-1.19/extensions/GlobalUsage/GlobalUsage_body.php 'r112688'
13:55 mark: Reinstalled strontium and palladium with hw raid1 and fully automatic lvm based partman recipe
12:51 schmir: upgraded mwlib to 0.13.5 on pdf cluster
11:35 logmsgbot: hashar synchronized wmf-config/codereview.php 'CodeReview: autodefers /trunk/extensions/ParserFun[/$] so ParserFunctions is not deferred'
11:32 logmsgbot: hashar synchronized wmf-config/codereview.php 'CodeReview: autodefers /trunk/extensions/ParserFun'
10:21 logmsgbot: hashar synchronized php-1.19/extensions/ApiSandbox 'ApiSandBox: r112114: show request time'
03:25 maplebed: took swift out of rotation - thumbnails now served by ms5
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Wed Feb 29 02:34:48 UTC 2012
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Wed Feb 29 02:18:05 UTC 2012
02:02 binasher: manually set large rmem_max and rmem_default on locke and restarted udp2log to stem packet loss, opened an rt ticket to fix the (lost) fix
00:55 Ryan_Lane: restarting gerrit again
00:23 Ryan_Lane: restarting gerrit on formey
00:20 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'mysql parser cache'
00:20 Tim: reimported schema files on db40 and re-enabled mysql parser cache

February 28

23:32 logmsgbot: midom synchronized wmf-config/db.php 'putting db34 back ha ha thanks for reminding'
23:31 logmsgbot: asher synchronized wmf-config/db.php 'pulling db24'
22:48 Ryan_Lane: rebuilding manganese to act as new gerrit server
22:41 logmsgbot: aaron synchronized php-1.19/extensions/FlaggedRevs/backend/FlaggedRevs.hooks.php 'rc_patrolled bug fix'
22:34 logmsgbot: aaron synchronized php-1.19/extensions/FlaggedRevs/business/RevisionReviewForm.php 'debug logging'
22:33 Tim: on db40: deleting mysqld data dir and recreating from schema files in /a/dump
21:57 mark: Setup servers strontium and palladium as additional (internal) bits servers in eqiad. awaiting connection of eth1-3 before deployment
20:50 logmsgbot: reedy synchronized php-1.19/extensions/Collection/Collection.body.php 'r112634'
20:49 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/specials/SpecialCentralAuth.php 'r112634'
20:48 logmsgbot: reedy synchronized php-1.19/resources/mediawiki/mediawiki.util.js 'r112632'
20:43 binasher: streaming a hotbackup of db1038 to db1004
20:40 binasher: streaming hot backup of db1006 to db1040
19:10 logmsgbot: reedy synchronizing Wikimedia installation... : Updating message stuffs
18:45 RoanKattouw: Installing Apache on cadmium
17:33 logmsgbot: aaron synchronized php-1.19/extensions/PagedTiffHandler/PagedTiffHandler_body.php
17:28 logmsgbot: aaron synchronized php-1.19/extensions/PagedTiffHandler/PagedTiffHandler_body.php
17:20 logmsgbot: aaron synchronized php-1.19/extensions/PagedTiffHandler/PagedTiffHandler_body.php 'deployed r112614'
08:49 logmsgbot: nikerabbit synchronized php-1.19/extensions/Narayam/Narayam.php 'Narayam gu mapping out of beta'
08:48 logmsgbot: nikerabbit synchronized php-1.18/extensions/Narayam/Narayam.php 'Narayam gu mapping out of beta'
04:48 Tim: on manutius: rebuilt torrus config DB, moved old one out to /var/lib/torrus/db.broken. Restarted.
04:30 Tim: torrus down, reporting "PANIC: fatal region error detected; run recovery" about its DB files, will stop apache to investigate
03:54 logmsgbot: tstarling synchronized deleted.dblist 'removed chwikimedia'
03:54 logmsgbot: tstarling synchronized all.dblist 'removed chwikimedia'
03:07 logmsgbot: tstarling synchronized php-1.19/includes/specials/SpecialMovepage.php 'r112572'
02:59 logmsgbot: catrope synchronized php-1.19/resources 'r112570'
02:59 logmsgbot: catrope synchronized php-1.19/extensions/CheckUser 'r112570'
02:58 logmsgbot: catrope synchronized php-1.19/includes/logging 'r112570'
02:44 logmsgbot: aaron synchronized php-1.19/includes/Block.php 'deployed r112564'
02:43 logmsgbot: aaron synchronized php-1.19/includes/resourceloader/ResourceLoaderContext.php 'deployed r112564'
02:43 logmsgbot: aaron synchronized php-1.19/resources/mediawiki/mediawiki.js 'deployed r112564'
02:33 logmsgbot: LocalisationUpdate completed (1.19) at Tue Feb 28 02:33:30 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Tue Feb 28 02:17:09 UTC 2012
02:13 binasher: moved db1006 to new s6 master
01:55 Tim: reduced disk space usage on srv191 by running logrotate manually
00:43 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: plwiki to 1.19
00:35 logmsgbot: aaron synchronized php-1.19/includes/StubObject.php 'trigger errors for debugging bad callbacks'
00:28 Tim: removed 2GB of syslogs on srv192
00:25 logmsgbot: tstarling synchronized php-1.18/includes/resourceloader/ResourceLoaderFileModule.php
00:21 logmsgbot: tstarling synchronized php-1.19/includes/api/ApiQueryLogEvents.php 'revert live hack'
00:20 logmsgbot: tstarling synchronized php-1.19/includes/api/ApiQueryRecentChanges.php

February 27

23:46 logmsgbot: catrope synchronized php-1.19/includes/api/ApiQueryLogEvents.php 'Attempted bugfix'
23:17 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: nlwiki to 1.19
22:54 logmsgbot: reedy synchronized php-1.19/includes/MessageBlobStore.php 'r112536'
22:45 RobH: dns update for manganese server
22:33 logmsgbot: reedy synchronized php-1.19/includes/ 'r112532'
22:31 binasher: new s4 master pos - MASTER_LOG_FILE='db31-bin.000253', MASTER_LOG_POS=457980068
22:30 logmsgbot: asher synchronized wmf-config/db.php 'done s4 switch'
22:29 binasher: switching s4 master to db31
22:29 logmsgbot: asher synchronized wmf-config/db.php 'switching s4 master to db31, setting read-only'
22:26 logmsgbot: reedy synchronized php-1.19/extensions/UploadWizard/UploadWizard.i18n.php 'r112531'
22:16 RobH: cadmium locked up, rebooting
22:14 binasher: running 1.19 schema migration script to get former s5, s6, s1 masters (db45, db47, db36)
22:09 binasher: new s1 (enwiki) master pos - MASTER_LOG_FILE='db38-bin.000129', MASTER_LOG_POS=255719721
21:56 logmsgbot: asher synchronized wmf-config/db.php 'done s1 switch'
21:55 logmsgbot: asher synchronized wmf-config/db.php 'swapping s1 enwiki master to db38, setting read-only'
21:53 binasher: preparing to swap enwiki master, it will be read only for a couple minutes
21:52 logmsgbot: catrope synchronized php-1.19/extensions/ProofreadPage/proofread.js 'r112522'
21:49 logmsgbot: catrope synchronized php-1.19/extensions/ProofreadPage/proofread.js 'r112522'
21:15 logmsgbot: nikerabbit synchronized php-1.18/resources/startup.js 'touch'
21:02 binasher: db1006 (s6-secondary) is still slaving from db47 - it's very behind post hw failure. need to manually swap to db43 once caught up
21:02 binasher: new s6 master pos - MASTER_LOG_FILE='db43-bin.000027', MASTER_LOG_POS=577074024
21:01 logmsgbot: asher synchronized wmf-config/db.php 'done s6 master swap'
20:59 logmsgbot: asher synchronized wmf-config/db.php 'swapping s6 master to db43, setting read-only'
20:59 binasher: preparing to switch s6 master
20:42 binasher: powercycled db1006 after finding nothing on the serial console. booted without issue, then started mysql.
20:41 logmsgbot: hashar synchronized php-1.19/extensions/CodeReview/backend/DiffHighlighter.php 'Special:CodeReview merge r112513 fix bug 27375'
20:31 binasher: new s5 master pos - MASTER_LOG_FILE='db35-bin.000011', MASTER_LOG_POS=374074061
20:27 logmsgbot: asher synchronized wmf-config/db.php 'done s5 master swap'
20:25 logmsgbot: asher synchronized wmf-config/db.php 'swapping s5 master to db35, setting read-only'
20:22 logmsgbot: hashar synchronized php-1.19/extensions/CodeReview/backend/DiffHighlighter.php 'Special:CodeReview fix HTML entities showing up in diff output r112459 r112464'
20:18 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Put oldwikisource/sourceswiki on 1.19wmf1
19:42 RobH: adjusted threshholds for ps1-b4-sdtpa.mgmt.pmtpa.wmnet again, bottom sensor set to high
18:32 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'I18ndeploy config changes: bug 33423 bug 34591'
18:26 logmsgbot: nikerabbit synchronized php-1.19/extensions/WebFonts/ 'i18ndeploy r112498'
18:26 logmsgbot: nikerabbit synchronized php-1.19/extensions/Narayam/ 'i18ndeploy r112498'
18:25 logmsgbot: nikerabbit synchronized php-1.19/languages/messages/ 'i18ndeploy r112498'
18:18 logmsgbot: nikerabbit synchronized php-1.18/extensions/WebFonts/ 'i18ndeploy r112497'
18:18 logmsgbot: nikerabbit synchronized php-1.18/extensions/Narayam/ 'i18ndeploy r112497'
18:17 logmsgbot: nikerabbit synchronized php-1.18/languages/messages/ 'i18ndeploy r112497'
18:01 logmsgbot: midom synchronized wmf-config/db.php
17:26 mark: Denying POST / requests on frontend squids
17:10 RobH: blog plugins updated, blog puppet config updated to support unzip package
16:55 RobH: blog updated to newest version
10:23 logmsgbot: hashar synchronized php-1.19/includes/Pager.php '(bug 34736) empty limit on special pages causes navigation issues'
03:02 Ryan_Lane: restarting varnish on arsenic
02:53 Ryan_Lane: moving bits traffic to pmtpa
02:44 Ryan_Lane: restarted varnish on niobium and arsenic
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Mon Feb 27 02:34:02 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Mon Feb 27 02:17:32 UTC 2012

February 26

02:34 logmsgbot: LocalisationUpdate completed (1.19) at Sun Feb 26 02:34:55 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Sun Feb 26 02:17:55 UTC 2012

February 25

02:33 logmsgbot: LocalisationUpdate completed (1.19) at Sat Feb 25 02:33:27 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Sat Feb 25 02:17:41 UTC 2012
01:53 RoanKattouw: Started transcode jobs on cadmium, 16 parallel jobs running in screen
01:45 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/FileBackend.php 'deployed r112382'
01:44 logmsgbot: aaron synchronized php-1.19/includes/StreamFile.php 'deployed r112382'
01:34 RoanKattouw: Installing screen on cadmium
01:26 RoanKattouw: Installing ffmpeg2theora on cadmium
01:25 RoanKattouw: Creating a 'wikimaniatranscode' user locally on cadmium because I don't really want to run ffmpeg as root
01:25 LeslieCarr: reloading cp1043 again
01:22 LeslieCarr: cp1043 is missing /var/lib/varnish/frontend
01:17 LeslieCarr: rebooted cp1043
01:09 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Remove outdated itwiki lockdown bypass code'
01:01 logmsgbot: aaron synchronized php-1.19/includes/StreamFile.php 'deployed r112379'
00:44 logmsgbot: aaron synchronized php-1.19/includes/filerepo/backend/FileBackend.php 'deployed r112377'
00:27 notpeter: starting indexer on searchidx2
00:18 notpeter: stopping indexer on searchidx2 again :/

February 24

23:12 LeslieCarr: restarted apache on singer again
23:10 notpeter: restarting indexer and lucene on searchidx2
23:05 LeslieCarr: Server should be SSL-aware but has no certificate configured [Hint: SSLCertificateFile] ((null):0)
23:03 LeslieCarr: reloading apache2 on singer
23:03 LeslieCarr: pushing new apache conf file to singer for secure.wikimedia.org - may impact performance of secure site
22:42 logmsgbot: aaron synchronized php-1.19/includes/specials/SpecialContributions.php 'deployed r112366'
22:39 LeslieCarr: reloaded apache2 on stafford (puppetmaster)
22:05 logmsgbot: aaron synchronized php-1.19/extensions/FlaggedRevs/frontend/modules/ext.flaggedRevs.review.js 'deployed r112361'
18:24 logmsgbot: aaron synchronized multiversion/MWVersion.php
18:18 logmsgbot: aaron synchronized multiversion/MWMultiVersion.php 'deployed r112335'
18:04 logmsgbot: aaron synchronized multiversion 'deployed all changes through HEAD'
17:58 logmsgbot: catrope synchronized php-1.19/LocalSettings.php 'Guard against /home/wikipedia not existing'
17:51 RoanKattouw: And of course I can't commit this because the code in /h/w/common/multiversion hasn't been updated this calendar year and there are undeployed commits from January *grumble*
17:47 notpeter: stopping indexing on searchidx2 to rsync over a clean copy of index to searchidx1001
17:45 logmsgbot: catrope synchronized multiversion/MWVersion.php 'Add file_exists check for /home before trying to access /home'
13:40 mark: Fixed directory permissions of /srv/swift-storage/{sda4,sdb4} on ms-be1
13:35 mark: Copied swift ring builder files from ms-fe1 to all swift hosts
12:02 logmsgbot: tstarling synchronized php-1.19/includes/PathRouter.php 'r112316'
12:02 logmsgbot: tstarling synchronized php-1.19/includes/AutoLoader.php 'r112316'
11:51 mark: Rebalanced swift rings account, container, object after adding ms-be5
11:49 mark: Added all devices on ms-be5 into the swift rings, new zone 5, weight 100
11:27 mark: Manually preparing swift filesystems sda4 and sdb4 on ms-be5
11:00 mark: Reinstalling ms-be5 to correct partitioning of sda and sdb
10:15 mark: Doing first puppet run on ms-be5
08:48 mark: Restarted apache on stafford to fix puppetmaster
05:32 K4-713: synchronized payments cluster to r112287
05:18 logmsgbot: tstarling synchronized php-1.19/extensions/SecurePoll/includes/pages/Page.php 'r112301'
05:18 logmsgbot: tstarling synchronized php-1.19/extensions/LandingCheck/SpecialLandingCheck.php 'r112301'
05:17 logmsgbot: tstarling synchronized php-1.19/extensions/DonationInterface/globalcollect_gateway/globalcollect.adapter.php 'r112301'
04:32 logmsgbot: tstarling synchronized php-1.19/includes/api/ApiFeedContributions.php
04:00 Tim: cleaning up /tmp on all apaches
03:57 logmsgbot: tstarling synchronized php-1.19/extensions/skins/Donate/Donate.class.php
03:56 logmsgbot: tstarling synchronized php-1.19/extensions/skins/Tomas/Tomas.class.php
03:56 logmsgbot: tstarling synchronized php-1.19/extensions/skins/Schulenburg/Schulenburg.class.php
03:49 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move wikisource over to 1.19
03:39 logmsgbot: tstarling synchronized php-1.19/extensions/ContributionTracking/ContributionTracking_body.php 'r112294'
03:36 Tim: cleaned up /tmp on mw35
03:36 logmsgbot: reedy synchronized php-1.19/extensions/DoubleWiki/DoubleWiki_body.php 'r112292'
03:15 logmsgbot: reedy synchronized wmf-config/CommonSettings.php
03:13 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix kowiki namespace aliases'
03:09 logmsgbot: tstarling synchronized php-1.19/includes/db/DatabaseMysql.php 'debugging patch with trigger_error'
03:07 logmsgbot: tstarling synchronized php-1.19/includes/db/DatabaseMysql.php 'debugging patch for array parameter warning'
03:06 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move specials over to 1.19
02:58 Tim: re-enabling wmerrors
02:39 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move wikitionarys over to 1.19
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Fri Feb 24 02:34:38 UTC 2012
02:32 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move wikibooks over to 1.19
02:24 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Move wikiquotes over to 1.19
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Fri Feb 24 02:18:10 UTC 2012
02:13 logmsgbot: reedy synchronized php-1.19/includes/api/ApiParamInfo.php 'r112291'
02:11 RoanKattouw: Installing nfs-kernel-server on cadmium
02:04 maplebed: deployed updated rewrite.py to swift to pass through error codes and error messages it gets from the back end during 404 handling.
02:03 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Wikimedia wikis to 1.19
01:48 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: switching all wikinewses to 1.19
01:44 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: and hewikisource
01:44 RoanKattouw: Installing ffmpeg on cadmium
01:44 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: reverted wikisources back to 1.18 except frwikisource
01:17 logmsgbot: aaron synchronized php-1.19/extensions/FlaggedRevs 'deployed r112284'
01:03 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: switching wikisource to 1.19 except for wikis with FlaggedRevs enabled
00:47 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Put all wikiversity on 1.19wmf1
00:24 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: wikinews back to 1.18
00:23 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: wikinews back to 1.18
00:19 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'reduce startup module expiry time for all projects except wikipedia'
00:19 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'reduce startup module expiry time for all projects except wikipedia'
00:05 K4-713: synchronized payments cluster to r112275

February 23

23:54 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Put all wikinews' on 1.19wmf1
23:35 logmsgbot: reedy synchronizing Wikimedia installation... :
23:13 logmsgbot: catrope synchronized php-1.19/extensions/ClickTracking 'r112264'
23:13 logmsgbot: catrope synchronized php-1.19/extensions/WikiEditor 'r112264'
23:12 logmsgbot: aaron synchronized php-1.19/extensions/OggHandler 'deployed r112264'
23:12 logmsgbot: catrope synchronized php-1.18/extensions/ClickTracking 'r112264'
23:12 logmsgbot: catrope synchronized php-1.18/extensions/WikiEditor 'r112264'
22:35 logmsgbot: reedy synchronized php-1.19/resources/startup.js 'touch'
22:35 logmsgbot: reedy synchronized php-1.19/extensions/UploadWizard/ 'r112256'
22:14 logmsgbot: reedy synchronized php-1.19/includes/resourceloader/ResourceLoaderStartUpModule.php 'r112249'
22:13 logmsgbot: reedy synchronized php-1.19/resources/mediawiki/mediawiki.feedback.css 'r112249'
22:07 Ryan_Lane: enabling LiquidThreads on labsconsole
22:02 maplebed: fixing permissions in /var/spool on brewster
21:46 RobH: rebooting cadmium for pxe test, not reinstalling it.
21:22 mark: Moved /var/spool/squid to its own LV on brewster
21:02 logmsgbot: aaron synchronized php-1.18/StartProfiler.php 'require() wmf-config profiler'
21:01 logmsgbot: aaron synchronized wmf-config/StartProfiler.php 'added empty arrays to constructors as needed'
21:00 logmsgbot: aaron synchronized php-1.19/StartProfiler.php
20:55 logmsgbot: aaron synchronized php-1.19/StartProfiler.php
20:54 logmsgbot: aaron synchronized php-1.19/StartProfiler.php 'require() wmf-config profiler'
20:50 logmsgbot: aaron synchronized wmf-config/StartProfiler.php 'Sample thumb.php traffic properly. Removed 'bigpage' and 'incubatorslowness' hacks. Split thumbnail group by MW version and added some more code comments.'
20:42 logmsgbot: aaron synchronized wmf-config/StartProfiler.php 'updated profiling class code paths to post-1.17 locations'
20:34 logmsgbot: aaron synchronized wmf-config/StartProfiler.php 'Sample thumb.php traffic properly. Removed 'bigpage' and 'incubatorslowness' hacks. Split thumbnail group by MW version and added some more code comments.'
20:01 logmsgbot: reedy synchronized php-1.19/extensions/UploadWizard/includes/specials/SpecialUploadWizard.php 'r112235'
19:55 logmsgbot: aaron synchronized wmf-config/StartProfiler.php 'removed commented out code and w/s cleanups'
18:34 logmsgbot: aaron synchronized php-1.19/includes/parser/Parser.php 'disabled per-template profiling'
17:26 RobH: db15 shutting down for memtest
15:45 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34401 - Change sitename and a namespace for Inuktitut Wikipedia'
15:43 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34401 - Change sitename and a namespace for Inuktitut Wikipedia'
15:42 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34401 - Change sitename and a namespace for Inuktitut Wikipedia'
14:56 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34407 - New aliases for ko.wikipedia'
14:52 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34566 - Modify the URL to upload files in eswiki'
14:50 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bump AF lottery odds for ptwikipedia'
14:47 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34476 - Set $wgLogo value for fawikisource'
14:45 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34570 - Set wgBabelMainCategory to false for dewiki'
14:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34615 - Disable Moodbar on Tamil Wikipedia'
14:40 logmsgbot: reedy synchronized wmf-config/flaggedrevs.php 'Bug 33273 - Enable FlaggedRevs on Ukrainian Wikipedia'
11:26 logmsgbot: tstarling synchronizing Wikimedia installation... : scap speed test
09:36 logmsgbot: tstarling synchronized php-1.18/includes/LocalisationCache.php 'revert live hack'
02:37 logmsgbot: LocalisationUpdate completed (1.19) at Thu Feb 23 02:37:12 UTC 2012
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Thu Feb 23 02:18:24 UTC 2012
01:37 K4-713: Updated and synchronized fraud prevention settings on the payments cluster.
01:04 logmsgbot: reedy synchronized php-1.19/resources/ 'r112174'
00:34 LeslieCarr: reinstalling neon

February 22

23:54 logmsgbot: reedy synchronized php-1.19/extensions/UploadWizard/resources/mw.UploadWizard.js 'r112167'
23:42 Ryan_Lane: restarting opendj on virt0
23:37 Tim: fixed ownership on /mnt/upload6/wikimedia/rs
23:37 logmsgbot: reedy synchronized php-1.19/extensions/WikiEditor/ 'r112164'
23:35 logmsgbot: reedy synchronized php-1.19/extensions/Vector/Vector.php 'r112164'
23:35 logmsgbot: reedy synchronized php-1.19/extensions/UploadWizard/resources/mw.fileApi.js 'r112164'
23:34 logmsgbot: reedy synchronized php-1.19/maintenance/language/ 'r112162'
23:32 logmsgbot: reedy synchronized php-1.19/includes/SkinTemplate.php 'r112162'
22:21 logmsgbot: aaron synchronized wmf-config/swift.php 'avoid notices'
22:19 logmsgbot: aaron synchronized wmf-config/swift.php
22:06 logmsgbot: aaron rebuilt wikiversions.cdb and synchronized wikiversions files: commonswiki -> 1.19
21:52 RobH: dataset1001 eth1 connected
21:18 logmsgbot: catrope synchronized php-1.19/thumb.php 'Cleanup: rename fake repo and add comments'
21:08 K4-713: updated the payments cluster to r112145
20:55 logmsgbot: catrope synchronized php-1.19/thumb.php 'And let's try that again'
20:54 logmsgbot: reedy synchronized php-1.19/resources/startup.js 'touch'
20:49 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'UW config for test2wiki'
20:45 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable UW on test2wiki'
20:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Enable UW on test2wiki'
20:43 Reedy: Created uploadwizard campaign related tables on test2wiki
20:35 LeslieCarr: flushed iptables on stafford - all puppet runs shoudl now work
19:49 logmsgbot: catrope synchronized php-1.19/thumb.php 'pass in a Title object to UnregisteredLocalFile'
19:46 logmsgbot: catrope synchronized php-1.19/thumb.php 'more logging'
19:41 logmsgbot: catrope synchronized php-1.19/thumb.php 'more logging'
19:39 logmsgbot: catrope synchronized php-1.19/thumb.php 'Add logging for no path supplied error'
19:39 LeslieCarr: blocking all new puppet connections on all hosts except neon
19:35 logmsgbot: catrope synchronized php-1.19/thumb.php 'use temp path correctly'
19:31 logmsgbot: catrope synchronized php-1.19/thumb.php 'readd debugging for 404s'
19:30 logmsgbot: catrope synchronized php-1.19/thumb.php 'disable debugging'
19:27 logmsgbot: catrope synchronized php-1.19/thumb.php 'attempt at debugging'
19:24 LeslieCarr: removing old wap (mobile) site from ekrem as it hasn't been accessed in a day
19:18 logmsgbot: catrope synchronized php-1.19/thumb.php 'missing name key'
19:16 logmsgbot: catrope synchronized php-1.19/thumb.php 'missing global'
19:13 logmsgbot: aaron synchronized php-1.19/includes/logging/LogFormatter.php 'deployed r112136'
19:05 RoanKattouw_away: running sync-common on srv256 because Aaron gets key errors for that box
18:48 logmsgbot: aaron synchronizing Wikimedia installation... : deploying r112128
18:36 Ryan_Lane: shutting down labstore1
17:04 notpeter: increasing mem for java to 3300 on pmtpa search hosts
16:09 Jeff_Green: restarted lsearchd on search3 and search9, was running but nonresponsive
16:09 Jeff_Green: restarted lsearchd on search15, was not running
15:34 notpeter: extending database wikiadmin user grants to 10.64.0.0/255.255.252.0
10:09 logmsgbot: hashar synchronized php-1.19/extensions/CodeReview/backend/DiffHighlighter.php 'r112098 - (bug 34554) diff chunk fail to parse file add/rm'
09:45 Tim: on fenair: stopped apache again due to overload. Restarted it with reduced MaxClients
09:34 logmsgbot: tstarling synchronizing Wikimedia installation... :
09:23 Tim: on fenari: started apache
09:09 logmsgbot: tstarling synchronizing Wikimedia installation... :
09:05 logmsgbot: tstarling synchronizing Wikimedia installation... :
09:02 logmsgbot: tstarling synchronizing Wikimedia installation... :
08:07 logmsgbot: ariel rebuilt wikiversions.cdb and synchronized wikiversions files: fix dewikiversity typo in wikiversions file
06:49 logmsgbot: tstarling synchronized php-1.19/languages/messages/MessagesEn.php 'test change for manualRecache'
06:33 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'enabling manual recache'
06:26 Tim: installed new scap manually since puppet is broken
05:51 logmsgbot: tstarling synchronizing Wikimedia installation... :
05:18 logmsgbot: catrope synchronized php-1.19/thumb.php 'Experimental fix for UploadStash thumbs in 1.19'
05:14 Tim: started xinetd
04:34 logmsgbot: catrope synchronized php-1.19/thumb.php 'Experimental fix for 1.19 UploadWizard thumb issue'
04:17 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files: rolling back commons to 1.18
03:48 logmsgbot: catrope synchronized php-1.19/skins/common/shared.css 'r112081'
03:25 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files:
03:25 Tim: switching commons back to 1.19
03:18 logmsgbot: tstarling synchronized php-1.19/includes/LocalisationCache.php
03:18 logmsgbot: tstarling synchronized php-1.19/includes/LocalisationCache.php
03:10 Tim: on fenari: NFS overload, killed apache and xinetd
03:03 RoanKattouw: Manually started ircecho on fenari ; why doesn't this happen upon boot? Why didn't puppet start it?
02:57 RoanKattouw: Synced php-1.19/includes/MessageBlobStore.php to disable ::clear() ; where's the logging bot?
02:53 LeslieCarr: manually lowering nagios max checks to 300
02:48 Tim: rebooted fenari, nonresponsive
02:46 LeslieCarr: reset the drac console for spence
02:21 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:18 logmsgbot: tstarling synchronizing Wikimedia installation... :
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Wed Feb 22 02:18:08 UTC 2012
02:15 Tim: testing new scap script ~tstarling/bin/scap-new
01:57 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/specials/ 'r112075'
01:18 logmsgbot: tstarling rebuilt wikiversions.cdb and synchronized wikiversions files:
01:18 Tim: reverting to 1.18 on commons due to DB overload
01:09 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Commonswiki to 1.19wmf1
01:00 logmsgbot: reedy synchronized php-1.19/includes/ 'r112073'
00:59 logmsgbot: reedy synchronized php-1.19/languages/messages/MessagesEn.php 'r112073'
00:51 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'reducing cache expiry for unversioned resources on commons'
00:51 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'reducing cache expiry for unversioned resources on commons'
00:41 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 'reducing cache expiry for unversioned resources on commons'

February 21

22:44 logmsgbot: catrope synchronized php-1.19/includes/resourceloader/ 'r112055'
22:20 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Comment out hack that enabled $wgResourceLoaderExperimentalAsyncLoading for logged-in users'
21:36 Ryan_Lane: force-running puppet on every labs instance
17:35 logmsgbot: reedy synchronized php-1.18/extensions/FeaturedFeeds/FeaturedFeeds.body.php 'r112029'
17:16 notpeter: reimaging searchidx1001 :(
17:10 logmsgbot: reedy synchronized php-1.19/includes/ 'r112024'
17:08 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/CentralAuthHooks.php 'r112023'
17:08 logmsgbot: reedy synchronized php-1.19/extensions/FeaturedFeeds/ 'r112023'
16:50 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34560 - Moodbar on ta.wikipedia'
15:39 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/CentralAuth.php 'Enable AntiSpooof for CentralAuth on all 1.19 wikis again, doesn't break signup with a mass of fail'
15:37 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth/CentralAuth.php 'Enable AntiSpoof for CentralAuth on testwiki only'
04:42 Tim: on db40: setting innodb-use-purge-thread=4 to test multithreaded purge
04:12 notpeter: disabling search lvs1 check because it's going to false-positive in 4 hours...
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Tue Feb 21 02:34:36 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Tue Feb 21 02:17:36 UTC 2012

February 20

22:14 logmsgbot: hashar synchronized php-1.19/includes/logging/PatrolLog.php 'r111969 - bug 34495 patrol log credit the user patrolled, not the user patrolling'
21:10 rainman-sr: shut down lucene on search15, comes up with some strange errors "Connection refused to host: 10.0.3.15"
18:46 logmsgbot: aaron synchronized php-1.19/includes/RevisionList.php 'deployed r111952'
18:26 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Narayam on knwiki; bug 34516'
18:19 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Narayam on mrwiki, mrwikisource; bug 32669, bug 34454'
18:17 logmsgbot: nikerabbit synchronized php-1.19/extensions/Narayam/ 'i18ndeploy r111946'
18:16 logmsgbot: nikerabbit synchronized php-1.18/extensions/Narayam/ 'i18ndeploy r111945'
17:22 notpeter: stopping puppet on brewster
15:56 notpeter: initial test-spinup of searchidx1001 and search1001-1006 (en cluster)
14:56 logmsgbot: hashar synchronized php-1.19/skins/simple/main.css 'r111580 for Bug 34397: align footer so that it does not overlap with sidebar in Simple skin'
13:36 logmsgbot: hashar synchronized php-1.19/includes/UserMailer.php 'r111925 for bug 34421 duplicate Subject / wrong To: headers in mail'
03:06 notpeter: re-enabling notifications for search-pool1 and search-pool3, search-pool2 still flapping very badly
02:36 logmsgbot: LocalisationUpdate completed (1.19) at Mon Feb 20 02:36:36 UTC 2012
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Mon Feb 20 02:18:57 UTC 2012

February 19

13:47 notpeter: disabling notifications for search lvs... if anyone still has their phone on
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Sun Feb 19 02:34:11 UTC 2012
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Sun Feb 19 02:17:17 UTC 2012

February 18

15:18 logmsgbot: reedy synchronized php-1.19/extensions/CentralAuth
15:17 logmsgbot: reedy synchronized php-1.19/extensions/AntiSpoof/
14:31 logmsgbot: reedy synchronized php-1.19/includes/actions/HistoryAction.php 'r111828'
10:41 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
10:41 logmsgbot: catrope synchronized php-1.19/resources/mediawiki/mediawiki.user.js 'touch'
10:40 logmsgbot: catrope synchronized php-1.19/resources/mediawiki/mediawiki.js 'touch'
08:56 apergos: restarted all the searchpool1 lsearchds
02:35 logmsgbot: LocalisationUpdate completed (1.19) at Sat Feb 18 02:35:12 UTC 2012
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Sat Feb 18 02:18:33 UTC 2012
01:59 logmsgbot: aaron synchronized php-1.19/extensions/CentralAuth/CentralAuth.php 'disabled AntiSpoof hooks which broken account creation with DB errors'
01:00 logmsgbot: andrew synchronizing Wikimedia installation... :
00:58 Andrew: Running scap to ensure a consistent environment
00:51 logmsgbot: andrew synchronized php-1.19/resources/mediawiki/mediawiki.user.js 'Attempt to repush r111695'
00:44 logmsgbot: andrew synchronized php-1.19/resources/Resources.php 'Deploy r111809'
00:43 logmsgbot: aaron synchronized php-1.19/resources/mediawiki.special/mediawiki.special.preferences.js 'deployed r111808'
00:21 logmsgbot: andrew synchronized php-1.19/extensions/Vector/modules/ext.vector.collapsibleNav.js 'Deploy r111806'
00:16 logmsgbot: andrew synchronized php-1.19/extensions/Vector/modules/ext.vector.collapsibleNav.js 'Deploy r111804'

February 17

22:51 logmsgbot: andrew synchronized php-1.19/includes/actions/HistoryAction.php 'Deploy r111800'
22:22 logmsgbot: andrew synchronized php-1.19/includes/Linker.php 'deploy r111798'
21:30 notpeter: remounted nfs mounts on searchidx2. to protect our house of cards
21:03 notpeter: unmounting ms nfs mounts from searchidx2
20:44 notpeter: restarting puppet on brewster
20:33 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Bug 34235 - Enable WebFonts on am.wikipedia'
20:13 maplebed: sending thumbnail traffic back to swift; head bug is fixed.
20:10 RobH: reinstalling search1001 and searchidx1001
19:53 RobH: search1001 down for reinstall
19:37 RobH: ran dsh command to remove all the /tmp/mw-cache-1.17
19:27 LeslieCarr: manually cleaned up tmp on mw41
19:27 RobH: manually cleaned up tmp on mw21
19:01 LeslieCarr: changed sockpuppet's post-merge hook so that you need to have ssh keys forwarded (though you really would anyways due to brokenness)
18:51 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php
18:50 logmsgbot: reedy synchronized wmf-config/ 'Sync ExtensionMessages'
18:44 logmsgbot: reedy synchronized php-1.19/extensions/WikimediaMessages/
18:43 logmsgbot: reedy synchronized php-1.18/extensions/WikimediaMessages/
18:22 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php
18:19 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php 'force index'
18:06 maplebed: sending thumbnail traffic back to ms5, taking swift out of production
18:00 notpeter: temporarily stopping puppet on brewster. please let me know if you need to turn it back on
17:52 maplebed: changing squids to send 100% of thumbnail traffic to swift
16:58 maplebed: turned swift live for 50% of all thumbnail requests
15:02 logmsgbot: hashar synchronized php-1.19/LocalSettings.php 'make jobrunners for testwiki to use the apache CommonSettings file instead of the non existant /home/wikipedia one'
14:59 logmsgbot: reedy synchronized php-1.19/includes/specials/SpecialDeletedContributions.php 'r111752'
14:58 logmsgbot: hashar synchronized php-1.19/LocalSettings.php
14:40 logmsgbot: hashar synchronized php-1.19/LocalSettings.php 'debug statement with argv join'
14:37 logmsgbot: hashar synchronized php-1.19/LocalSettings.php 'syslog debug statement to investigate TESTWIKI issue'
13:40 ^demon: srv233: removed /tmp/mw-cache-1.17 to give it a little more space for now
09:58 mark: Shutdown ragweed for decommissioning
07:42 binasher: upgraded mysql on db40 to 5.1.53-facebook-r3753, enabled innodb_use_purge_thread
05:39 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php
05:38 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php
05:35 Tim: on db40: reduced to 10M, should be causing massive delays, but the site's not down and the purge rate is lower if anything. Going to disable the mysql parser cache entirely.
05:25 Tim: on db40: purge lag is still increasing at 108 per second, so reducing innodb_max_purge_lag to 50M
05:21 Tim: on db40: giving the innodb manual the benefit of the doubt and following its advice, setting innodb_max_purge_lag to 100M, which should give a delay of 4.5ms
05:13 Tim: killing purgeParserCache.php since it is probably doing more harm than good
02:43 maplebed: deployed updated thumb_handler.php to ms5 to include Content-Length in generated images
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Fri Feb 17 02:34:32 UTC 2012
02:29 Ryan_Lane: installed labstore1-4
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Fri Feb 17 02:17:35 UTC 2012
02:16 Tim: on db40: truncating pc008 - 15
02:01 binasher: db1035 is replicating again
01:12 Ryan_Lane: re-enabled the mobile plugin for the blogs, seems w3 total cache supports varying
01:03 Ryan_Lane: disabling mobile skin for the blogs - we need to fix varnish support first
00:38 Ryan_Lane: fixed singer by adding in ssl configuration to the planet configuration

February 16

23:53 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Tweak live hack so async loading is enabled for all logged-in users on all 1.19 wikis'
23:10 binasher: truncated 4 tables on db40
23:10 logmsgbot: catrope synchronized php-1.19/resources/startup.js 'touch'
23:09 logmsgbot: catrope synchronized php-1.19/resources/mediawiki/mediawiki.js 'r111699, r111700'
23:09 logmsgbot: catrope synchronized php-1.19/includes/resourceloader/ 'r111699'
23:04 binasher: db1035 is fubar after crashing during schema migrations, running a hotbackup from db1019
22:59 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Also give User:Cmcmahon experimental async loading on meta'
22:37 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Add live hack to enable $wgResourceLoaderExperimentalAsyncLoading on meta only for me (User:Catrope)'
22:35 binasher: db1035 died 2 days ago, attempting to power cycle
22:26 binasher: adding search15 to search-pool2 lvs vip
22:23 Ryan_Lane: restarting pdns ns2
22:17 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Enable wgResourceLoaderExperimentalAsyncLoading on test2wiki'
21:45 apergos: singer certificate issues, looks like
21:40 logmsgbot: reedy synchronized php-1.19/extensions/Vector/modules/ext.vector.collapsibleNav.js 'r111687'
21:18 apergos: the most recent apache update (thanks puppet) must have broke things on singer. the url.wm.o config wants /srv/org/wikimedia/url/ but I have no idea what that service ever did or what is supposed to be in there. will someone who knows this undocumented information please check it? thanks.
21:16 notpeter: stopping mysql and apache on searchidx2... not sure why they are there. also, going to clean up some packages... like the ubuntu version of mediawiki
20:29 logmsgbot: reedy synchronized php-1.19/includes/api/ApiQueryAllUsers.php 'r111675'
19:45 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php 'pushing comment changes :)'
19:40 LeslieCarr: running /etc/network/if-up.d/initcwnd on the apaches
19:36 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php
19:30 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php
19:29 AaronSchulz: doing some debugging for bug 34451
19:29 logmsgbot: aaron synchronized php-1.18/includes/api/ApiQueryAllUsers.php
19:26 logmsgbot: aaron synchronized php-1.19/includes/api/ApiQueryAllUsers.php
19:25 logmsgbot: aaron synchronized php-1.19/includes/api/ApiQueryAllUsers.php
19:04 apergos: restarted lighty on dataset2, silly thing
19:02 LeslieCarr: restarted pdns on ns0
18:37 LeslieCarr: reverting $lang.wap.wikipedia.org dns changes
18:33 LeslieCarr: updating $lang.wap.wikipedia.org dns to point to mobile-lb.eqiad.wikimedia.org
17:31 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: betawikiversity to 1.19wmf1
17:23 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: hewikisource to 1.19wmf1
17:21 logmsgbot: hashar rebuilt wikiversions.cdb and synchronized wikiversions files: moving eowiki to 1.19
16:58 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: frwikisource to 1.19wmf1
16:55 logmsgbot: reedy synchronized php-1.18/extensions/FeaturedFeeds/ 'r111650'
16:53 RobH: rebooting sq35 & sq38, serial console blank
16:30 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: enwikibooks and enwikiquote to 1.19wmf1
13:25 Reedy: Running cleanupUploadStash.php across all wikis
12:14 mark: Shutdown lily for decommissioning
11:47 mark: Moved udpmcast unicast-to-multicast HTCP relay from lily to hooft
09:49 Tim: on db40: truncated pc004, pc005, pc006, pc007
09:46 logmsgbot: hashar synchronized wmf-config/swift.php 'Add a wfDebugLog call for bug 34440: swift list_objects giving InvalidResponseException'
05:21 Tim: on hume: running mwscript purgeParserCache.php --wiki=enwiki --age=7776000
05:19 Tim: truncated pc000, pc001, pc002, pc003
05:11 Tim: on db40: truncating a few shards to free up space for the OS
04:29 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php
04:16 logmsgbot: tstarling synchronized php-1.18/StartProfiler.php
03:43 logmsgbot: tstarling synchronized wmf-config/db.php 'restored db12 in query groups now that the schema changes have finished'
02:46 logmsgbot: aaron synchronized php-1.19/extensions/Translate/TranslateHooks.php 'live hack to deal with 500s on log/RC views'
02:34 logmsgbot: LocalisationUpdate completed (1.19) at Thu Feb 16 02:34:30 UTC 2012
02:32 logmsgbot: asher synchronized wmf-config/StartProfiler.php 'setting 1.18 wiki profiling id to all'
02:25 logmsgbot: asher synchronized wmf-config/db.php 'moving watchlist etc from db12 to db53'
02:21 logmsgbot: tstarling synchronized php-1.18/StartProfiler.php
02:18 logmsgbot: tstarling synchronized php-1.18/StartProfiler.php
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Thu Feb 16 02:18:23 UTC 2012
02:18 logmsgbot: tstarling synchronized php-1.19/StartProfiler.php
02:15 logmsgbot: tstarling synchronized php-1.19/StartProfiler.php
02:14 logmsgbot: tstarling synchronized php-1.19/StartProfiler.php
02:11 logmsgbot: tstarling synchronized wmf-config/StartProfiler.php 'split out 1.19'
02:06 logmsgbot: reedy synchronized php-1.19/includes/filerepo/backend/FileBackend.php 'Add wfProfileOut( __METHOD__ )'
01:59 logmsgbot: aaron synchronized php-1.19/includes/parser/CoreParserFunctions.php 'fixed profiling calls'
01:56 binasher: 1.19 schema migraitons now running on enwiki slaves
01:35 Ryan_Lane: rebooting formey
01:33 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php
01:31 Ryan_Lane: distupgrading formey
01:29 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php 're-enabled WikimediaLicenseTexts'
01:28 logmsgbot: tstarling synchronized php-1.19/extensions/WikimediaMessages/WikimediaLicenseTexts.i18n.php
01:08 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Just disable wmgUseWikimediaLicenseTexts for testwiki'
01:08 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Scrap that'
01:04 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Disable WikimediaLicenseTexts for the time being'
00:55 Ryan_Lane: rebooting prototype.wikimedia.org
00:50 Ryan_Lane: dist-upgrading prototype.wikimedia.org
00:49 logmsgbot: aaron synchronized extract2.php 'fixed access to protected Page field'
00:47 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: meta to 1.19wmf1
00:44 maplebed: starting to delete broken thumbnails from swift and squid. job running in a screen session on ms-fe1
00:38 logmsgbot: aaron synchronized live-1.5/robots.php 'fixed access to protected Page field'
00:35 binasher: changing search15 to run regular search-pool2 indexes instead of highlights
00:35 logmsgbot: tstarling synchronized php-1.19/includes/HistoryBlob.php
00:29 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: strategywiki, usabilitywiki, simplewiki and simplewiktionary to 1.19wmf1
00:24 logmsgbot: aaron synchronized php-1.19/includes/Article.php
00:16 logmsgbot: tstarling synchronized php-1.19/includes/HistoryBlob.php 'temp fix for checksum bug'

February 15

23:57 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: mw.org to 1.19wmf1
23:53 logmsgbot: reedy synchronized php-1.19/includes/OutputPage.php 'r111599'
23:52 logmsgbot: reedy synchronized php-1.19/resources/mediawiki/mediawiki.js 'r111599'
23:47 pgehres: re-enabled fundraising queue consumption after exorcism and rain dance
23:46 binasher: running a slow staggered restart of lsearchd
23:06 binasher: updated /etc/lsearch.conf:Rsync.path to "/usr/local/bin/rsync-no-pagecache" on all search nodes
23:06 binasher: installed pagecache-management on all search nodes
23:01 pgehres: disabling fundraising queue consumption on Aluminium due to jenkins build failure
20:56 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilt message lists and ExtensionMessages
20:50 logmsgbot: reedy synchronizing Wikimedia installation... : Pushing r111581 and making sure everything on the cluster is upto date
19:57 RoanKattouw: I meant l10n_cache* of course
19:54 RoanKattouw: Deleting /tmp/mw-cache-*/l10nupdate-* on the image scalers
19:49 logmsgbot: catrope synchronized php-1.18/includes/Cdb_PHP.php 'Live hack for logging filenames in CDB errors'
19:44 RoanKattouw: Syncing l10nupdate files for 1.19
19:27 RoanKattouw: Manually running the LU script for 1.19
19:25 RoanKattouw: Fixed perms for /h/w/c/php-1.19/cache/l10n on fenari
19:21 logmsgbot: LocalisationUpdate failed (1.19) at Wed Feb 15 19:21:54 UTC 2012
19:21 logmsgbot: LocalisationUpdate completed (1.18) at Wed Feb 15 19:21:53 UTC 2012
19:17 RoanKattouw: Let's try that again: copying /wd/Wikimania\ Edited to /a on cadmium
19:14 RoanKattouw: Aborted copy operation on cadmium, data won't fit
19:13 RoanKattouw: Copying all Wikimania files from the removable HD to cadmium's HD
19:05 RoanKattouw: Rerunning l10nupdate by hand to hopefully fix CDB problems
19:02 RoanKattouw: Fixing permissions for php-1.18/cache/l10n on all apaches as root
18:56 RoanKattouw: Running sync-l10nupdate by hand to see what kind of perms errors I get, first for 1.18 then for 1.19
17:01 RobH: cadmium setup for wikimania video transcoding
16:37 RobH: forgot to log, carbon resumed service normally
16:23 RobH: carbon halted, allows login and freezes on password entry, rebooting
15:53 RobH: os install on cadmium
15:49 RobH: updating dns for cadmium
15:06 logmsgbot: reedy synchronized php-1.19/extensions/SpamBlacklist/SpamBlacklistHooks.php 'r111543'
13:12 logmsgbot: hashar synchronized wmf-config/codereview.php
11:32 mutante: sync-apache / graceful not logged anymore by logmsgbot ?
10:06 mutante: office.wm now forces https (in a less broken way;) (remnant.conf)
09:42 mutante: made a new change to remnant.conf and synced apaches in a fresh attempt to fix office.wm redirect
08:59 Nirvanchik: test
08:13 mutante: all search boxes had /home: Stale NFS file handle.. remounting
08:03 mutante: remounted /home on search6, started lsearchd
02:37 Tim: on kaulen: re-enabled jsonrpc.cgi and reduced MaxClients from 500 to 100
02:18 logmsgbot: LocalisationUpdate failed (1.19) at Wed Feb 15 02:18:04 UTC 2012
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Wed Feb 15 02:18:04 UTC 2012
01:18 logmsgbot: reedy synchronized php-1.19/extensions/Gadgets/Gadgets_body.php
01:09 logmsgbot: reedy synchronized php-1.19/extensions/Gadgets/Gadgets_body.php
00:38 maplebed: deployed new thumb_handler.php with ETag header added in to ms5
00:11 logmsgbot: reedy synchronized php-1.19/extensions/Contest/Contest.php 'revert live hack'

February 14

23:52 logmsgbot: reedy synchronized wmf-config/flaggedrevs.php 'Enable FR like dewiki on test2wiki'
23:43 LeslieCarr: allowed ipv6 pim on edge routers in the US
23:25 logmsgbot: reedy synchronized php-1.18/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r111506'
23:21 LeslieCarr: modifying "martian" blocks on cr2-eqiad to allow newly allocated ip ranges
23:19 logmsgbot: reedy synchronized php-1.18/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r111506'
23:18 logmsgbot: reedy synchronized php-1.19/includes 'r111486'
22:53 logmsgbot: reedy synchronized php-1.19/includes 'r111486'
22:46 logmsgbot: reedy ran sync-common-all
22:45 maplebed: deployed squid config to upload to send all thumbnail traffic to ms5 instead of swift
22:33 Reedy: running ddsh -F5 -cM -g mediawiki-installation 'sudo -u mwdeploy rm -rf /usr/local/apache/common-local/php-1.17'
22:26 Reedy: Removing php-1.17 from fenari
22:10 binasher: restarted the 1.19 schema migration script - it's going to hit the just rotated s3 (db34), s2 (db30), s7 (db16), and s4 (db31) ex-masters before resuming s5 (db55) and all s6/s1 slaves
22:06 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Setting wmfUseRevSha1Columns'
21:55 Ryan_Lane: restarting ircecho on spence
21:48 binasher: new s4 master pos - MASTER_LOG_FILE='db22-bin.000030', MASTER_LOG_POS=964208442
21:48 logmsgbot: asher synchronized wmf-config/db.php 'setting s4 to writeable, new master is db22, db31 still out'
21:47 logmsgbot: asher synchronized wmf-config/db.php 'setting s4 to read-only, switching master to db22'
21:39 logmsgbot: asher synchronized wmf-config/db.php 'returning db30 to s3, going to wait til after schema migrations to upgrade to lucid/new-mysql'
21:36 binasher: new s2-master pos - MASTER_LOG_FILE='db13-bin.000278', MASTER_LOG_POS=599752853
21:36 logmsgbot: asher synchronized wmf-config/db.php 'setting s2 to writeable, db13 is new master'
21:34 logmsgbot: asher synchronized wmf-config/db.php 'setting s2 to read-only, switching master to db13'
21:29 logmsgbot: asher synchronized wmf-config/db.php 'db34 upgrading, returning to s3'
21:15 binasher: new s3 master position - MASTER_LOG_FILE='db39-bin.000550', MASTER_LOG_POS=63238699
21:15 logmsgbot: asher synchronized wmf-config/db.php 'returning s3 to writeable, db39 is the new master'
21:13 logmsgbot: asher synchronized wmf-config/db.php 'setting s3 to read-only, switching master to db39'
21:02 logmsgbot: asher synchronized wmf-config/db.php 'returning db16 after upgrading mysql'
20:58 binasher: new s7 repl position - MASTER_LOG_FILE='db37-bin.000285', MASTER_LOG_POS=865712092
20:52 logmsgbot: asher synchronized wmf-config/db.php 'setting s7 to read-only for master swap, db37 to be new master, db16 still out'
20:50 logmsgbot: asher synchronized wmf-config/db.php 'setting s7 to read-only for master swap, db37 to be new master'
20:45 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Fix hewiki namespace talk typo'
20:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix some mrwikisource aliases'
20:38 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix some mrwikisource aliases'
20:33 logmsgbot: hashar synchronized php-1.19/maintenance/purgeList.php 'r111480 : enable purge of HTTPS URLs'
20:29 mutante: purged https://office link, using modified purgeList.php that accepts https urls, thanks hashar
20:27 logmsgbot: hashar synchronized php-1.18/maintenance/purgeList.php 'r111480 : enable purge of HTTPS URLs'
20:13 Ryan_Lane: built two raid6 arrays per labstore host. raid sets are initializing.
19:53 logmsgbot: reedy synchronized wmf-config/db.php 'Bring db46 back in'
19:21 RoanKattouw: Ran Varnish purge for 'office
19:19 mutante: used purgeList.php on office.wm URLs, but it appears to be in varnish cache (broken redirect)
19:06 AaronSchulz: gracefulled apaches to deal with APC corruption
18:57 mutante: reverting the (circular) office redirect, syncing..
18:49 mutante: running sync-apache to fix office redirect
18:39 logmsgbot: reedy synchronized wmf-config/db.php 'Comment out db46 from s6 due to really high lag from db schema updates'
16:02 mutante: spence lost /home, mount was "Stale NFS file handle", causing outage of stats.wikimedia.org, fixed by remounting
15:48 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34378 - Rename namespaces on mr.wikisource.org'
15:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34342 - Create a new books namespace on he.wiki'
14:05 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: testwiki to 1.19
09:29 logmsgbot: tstarling synchronized php-1.18/extensions/cldr/LanguageNames.body.php 'r111453'
07:19 apergos: symlinkd wikidiff2.so to php_wikidiff2.so on searchidx2
03:17 logmsgbot: reedy synchronized wmf-config/ExtensionMessages-1.19.php 'Fix fr message file locations'
02:48 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild messages
02:48 logmsgbot: tstarling synchronized wmf-config/AdminSettings.php 'remove $wgUseRootUser and $wgUseNormalUser, broken since 1.17 and 1.16 respectively'
02:41 logmsgbot: reedy synchronized php-1.19/extensions/Contest/Contest.php 'Comment out stupid die for the moment'
02:38 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild messages
02:38 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 're-enabling the collection extension'
02:35 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuild messages
02:26 logmsgbot: reedy synchronized php-1.19/extensions/VisualEditor
02:25 logmsgbot: reedy synchronized php-1.19/extensions/FundraiserLandingPage/
02:25 logmsgbot: LocalisationUpdate failed (1.19) at Tue Feb 14 02:25:33 UTC 2012
02:25 logmsgbot: LocalisationUpdate completed (1.18) at Tue Feb 14 02:25:32 UTC 2012
02:19 logmsgbot: reedy synchronized wmf-config/ExtensionMessages-1.19.php 'Remove variablepage'
01:54 Reedy: Make that ddsh -F5
01:53 Ryan_Lane: when rebooting hume I also applied security updates
01:52 Tim: started indexer on searchidx2 with /home/rainman/scripts/search-restart-indexer per docs
01:52 Reedy: running ddsh -F30 -cM -g mediawiki-installation /usr/bin/sync-common
01:47 Tim: rebooting srv193
01:45 Tim: on searchidx2: doing apt-get upgrade and rebooting
01:44 Ryan_Lane: rebooting hume
01:28 binasher: resuming 1.19 schema migrations after fenari reboot (on first s4 commons slave, db22)
01:19 Tim: rebooting fenari for kernel upgrades
01:14 Tim: doing apt-get upgrade on fenari
01:12 Tim: rebooted fenari to fix stale NFS file handle
00:57 LeslieCarr: rebooted nfs1 as it was unresponsive on console and via IP
00:37 Reedy: killed /usr/local/apache/common/php-1.19 from apaches

February 13

23:29 logmsgbot: reedy ran sync-common-all
23:13 logmsgbot: reedy synchronized wmf-config/abusefilter.php 'Hard code wgAbuseFilterStyleVersion as it went away in 1.19'
23:05 logmsgbot: reedy synchronizing Wikimedia installation... : For good measure
23:01 logmsgbot: reedy rebuilt wikiversions.cdb and synchronized wikiversions files: Switch test2wiki to 1.19wmf1
22:57 Tim: increased concurrency on the image scalers from 10 to 15
22:33 logmsgbot: reedy synchronized php-1.19/includes/api/ApiWatch.php 'r111422'
22:29 Tim: on pdf1: killed a convert process that had been running since Jan 6
22:20 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'disabling the collection extension due to image scaler overload'
21:28 LeslieCarr: reloading brewster
21:17 LeslieCarr: copied a resolv.conf to brewster, apt-get upgrade on brewster and restarted lighttpd and squid on brewster
20:48 Ryan_Lane: rebooting brewster
17:37 logmsgbot: reedy synchronizing Wikimedia installation... : Rebuilt trusted-xff.cdb
17:12 mutante: mailman: deleting test-list
16:31 logmsgbot: reedy synchronized php-1.19/extensions/OggHandler/ 'r111385'
16:31 logmsgbot: reedy synchronized php-1.19/extensions/PagedTiffHandler/ 'r111385'
16:20 logmsgbot: reedy synchronized php-1.19/includes/ 'r111382'
16:19 logmsgbot: reedy synchronized php-1.19/extensions/CategoryTree/ 'r111382'
15:11 logmsgbot: reedy synchronized php-1.18//includes/ 'Bringing across 1.18wmf1 livehacks'
15:03 logmsgbot: reedy synchronizing Wikimedia installation... : Reverting roans live hacks for bug 31576
14:29 logmsgbot: reedy synchronized wikimedia.dblist 'Fix double bewikimedia'
14:28 logmsgbot: reedy synchronized s3.dblist 'Fix double bewikimedia'
14:28 logmsgbot: reedy synchronized pmtpa.dblist 'Fix double bewikimedia'
14:28 logmsgbot: reedy synchronized all.dblist 'Fix double bewikimedia'
14:27 logmsgbot: reedy synchronized 1.17.dblist '1.17-phase1.dblist 1.17-phase2.dblist all.dblist big.dblist closed.dblist deleted.dblist fishbowl.dblist flaggedrevs.dblist news.dblist new_wiktionaries.dblist pmtpa.dblist pmtpa-dump1.dblist pmtpa-dump2.dblist pmtpa-dump3.dblist private.dblist readonly.dblist s1.dblist s2.dblist s2-fixed.dblist s3.dblist s3-fixed.dblist s4.dblist s5.dblist s6.dblist s7.dblist small.dblist special.dblist switchover-ju
13:44 logmsgbot: reedy synchronized php-1.19/extensions/MobileFrontend
02:32 Tim: on kaulen: increased MaxClients to 500 to better deal with the connection flood
02:23 Tim: bugzilla is mostly working now, although it's very slow. The DDoS requests are blocked after connection setup using <Location>
02:21 Tim: on kaulen: restored MaxClients
02:17 logmsgbot: LocalisationUpdate completed (1.18) at Mon Feb 13 02:17:50 UTC 2012
01:46 Tim: temporarily moved bugzilla to port 444 until the connection flood (~1k req/s) subsides
01:15 Tim: started apache with MaxClients=30
00:59 Tim: after kaulen came back up, it was immediately overloaded with jsonrpc.cgi. Stopped apache.
00:54 Tim: kaulen is not responding on ssh, web down, rebooting

February 12

12:09 mark: Killed lsearchd processes on search8, restarted
12:07 mark: Rebalanced mw API app servers from load 120 to 150 in pybal list
10:08 mark: Increased MaxClients to 100 on API apaches in Puppet
09:45 mark: Restricted only opensearch API requests to the API squids
09:43 mark: Restricted only opensearch API requests to the API backend apaches, other API requests now hit the main mediawiki cluster
08:44 mark: maximum_forwards change deployed to all squids
08:42 mark: Set maximum_forwards 2 in squid.conf, deployed to the API squids only so far, rest is pending
07:52 binasher: restarted lsearchd on search{3,4,9}
02:19 logmsgbot: LocalisationUpdate completed (1.18) at Sun Feb 12 02:19:17 UTC 2012

February 11

20:31 apergos: restarted lightty on dataset2
17:28 RobH: manual test of each affected service complete, db9 fully online.
17:26 RobH: db9 moved, all systems online
17:08 RobH: db9 shutting down to move racks, offline during this includes: blogs, bugzilla, racktables, rt, survey, etherpad, observium
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Sat Feb 11 02:18:36 UTC 2012
00:17 logmsgbot: reedy synchronizing Wikimedia installation... :

February 10

22:17 LeslieCarr: fixing the labs apache2 puppet groups
21:48 RobH: memory in cp1017 wasnt properly seated as far as i can tell, if it doesnt mess up again it should be ok.
21:41 RobH: cp1017 being tested for bad memory
21:36 RobH: powercycling msw-a2-eqiad resolves all mgmt issues in rack
21:34 RobH: powercycling msw-a1-eqiad.
21:29 RobH: db1001 rebooting, locked up
20:53 RobH: updating dns for new db hosts
19:59 Reedy: Checking out 1.19wmf1 to /tmp on fenari
19:12 RobH: oxygen setup and installed per rt2343, still needs puppet runs and full deployment per rt 2430
17:58 RobH: updating dns for oxygen internal ip
17:21 mutante: labs logging is broken
17:14 RobH: oxygen offline for hard disk upgrade to replace locke
16:50 mutante: running sync-apache, trying to redirect office.wm to https
16:07 mark: Rebalanced appserver load balancing by giving the new mw* pmtpa app servers weight 150 in the pybal server list
15:17 mark: Turned on KeepAlive on apaches for better miss service times from eqiad
13:42 mark: Configured cp1001 and cp1020 to contact backend servers directly instead of via pmtpa squids
12:02 mark: Decommissioning sq38, sq46 and sq47 in squid configurator
11:50 mark: Making cp1001-1005 API squids
05:08 maplebed: deployed squid config to uploads to send 100% of thumbnail traffic to swift
02:49 maplebed: deploying fix for & bug with swift (files with an & in the name wouldn't load properly)
02:18 logmsgbot: LocalisationUpdate completed (1.18) at Fri Feb 10 02:18:37 UTC 2012
00:22 LeslieCarr: increased nagios max concurrent checks on spence and lowered the interval between processing them
00:20 maplebed: deployed squid config to upload squids rolling thumbnails back to 75% handled by swift to test the & bug

February 9

22:22 Jeff_Green: adding community-analytics.wikimedia.org to DNS
21:36 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Increase wgCodeReviewMaxDiffPaths'
20:04 binasher: started 1.19 schema migrations on seconadry db's (mwscript upgrade-1.19wmf1-1.php --secondary)
19:32 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Increase wgCodeReviewMaxDiffPaths'
17:31 maplebed: deployed squid config to upload squids sending 100% of all thumbnail traffic to swift
16:31 mark: Sending ALL non-european wikipedia traffic to eqiad text squids
16:17 mark: Added Brazil traffic to eqiad text squids
15:49 mark: Sending some Asian wikipedia traffic to wikipedia-lb.eqiad
15:13 mark: Sending Canadian wikipedia traffic to wikipedia-lb.eqiad
14:27 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34183 - add alias redirection in ml wikisource'
14:23 mark: Redirecting {wikiquote,wikisource,wikiversity,wiktionary}-lb.pmtpa traffic to .eqiad (geodns)
14:17 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34253 - Change site logo for lb.wiktionary'
14:11 mark: Redirecting {foundation,wikibooks,wikinews}-lb.pmtpa traffic to .eqiad (geodns)
14:10 logmsgbot: hashar synchronized docroot/mediawiki/xml 'update html pages for 0.1 and 0.2 schemes'
14:06 logmsgbot: hashar synchronized docroot/mediawiki/xml/index.html 'Give a home page to our XML namespaces homepage'
13:52 logmsgbot: hashar synchronized docroot/mediawiki/xml
13:45 mark: Redirecting wikimedia-lb.pmtpa traffic to wikimedia-lb.eqiad (geodns)
13:28 mark: Redirecting mediawiki-lb.pmtpa traffic to mediawiki-lb.eqiad (geodns)
12:49 apergos: around 11:05 UTC, increased /proc/sys/vm/min_free_kbytes from 16259 to 262144 to check impact on network alloc issues, on dataset1001
11:56 mark_: Power cycled cp1017
11:50 mark_: Changed topology of eqiad text squids to request from pmtpa (similar to esams)
09:29 mark_: Reactivated term selected-paths in policy-statement BGP_transit_in on cr2-eqiad, making path 14907 3257 1299 43821 active again
02:26 logmsgbot: LocalisationUpdate completed (1.18) at Thu Feb 9 02:26:49 UTC 2012
00:35 logmsgbot: asher synchronized wmf-config/db.php 'returning db38 to s1 post schema migration testing'

February 8

23:52 logmsgbot: asher synchronized wmf-config/db.php 'returning db11'
23:49 logmsgbot: asher synchronized wmf-config/db.php 'pulling db11'
22:01 maplebed: deployed updated to upload squid.conf to send 75% of all thumbnail traffic to swift
17:37 maplebed: deployed new squid config to upload to direct 50% of thumbnail traffic to swift
17:21 apergos: reboot of dataset1001 for testing.
14:21 mutante: srv189 - shut down again, PCI Express Error, -> RT 2413 created
14:15 mutante: powercycling srv189
13:09 RoanKattouw: Batch-deleting 9,760 redirects on cswiktionary, requested by Danny B
02:26 logmsgbot: LocalisationUpdate completed (1.18) at Wed Feb 8 02:26:55 UTC 2012

February 7

22:11 maplebed: increasing traffic to swift from 12.5% to 25%
21:19 apergos: reboot dataset1001, new kernel 2.6.38, should deal with kswap bug we just ran into under heavy i/o load, need to test
20:24 mark: Uncommented sq63 in squid text-settings.php, please DO NOT COMMENT DOWN SQUIDS UNLESS DECOMMISSIONING THEM PERMANENTLY
20:19 RobH: dns update for db59/60 mgmt
20:07 mark: Added manutius's IP to the stats ACL in squid.conf
19:47 Jeff_Green: email aliases adjusted for merchandise@, wikishop@
19:09 mark: Deployed torrus on server manutius, migrated DNS over
18:48 maplebed: deployed squid config to send 1/8th of all thumbnail traffic to swift
18:32 K4-713: Updated the payments cluster to r110857
18:27 RobH: updating dns for mgmt on db59-60 and labstore1-4
17:16 cmjohnson1: power cable swap cr2-pmtpa complete
17:12 cmjohnson1: swapping power cables for cr2-pmtpa
15:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Extra namespaces for mwikisource bug 33907'
15:35 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Extra namespaces for mwikisource bug 33907'
15:30 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'set project namespace for mrwikisource'
13:25 logmsgbot: reedy synchronized php-1.18/extensions/ShortUrl/ 'r110845'
12:43 logmsgbot: reedy synchronized php-1.18/extensions/ShortUrl/ 'r110841'
02:58 logmsgbot: asher synchronized wmf-config/db.php 'adding db35 back'
02:25 logmsgbot: LocalisationUpdate completed (1.18) at Tue Feb 7 02:25:17 UTC 2012
00:04 binasher: streaming hotbackup of db1021 to db35
00:02 binasher: shutting down mysql on db35, rebuild fail
00:00 logmsgbot: asher synchronized wmf-config/db.php 'pulling db35'

February 6

22:10 binasher: pulled db38 from enwiki, running normal "alter table revision add rev_sha1" and on db1043, the pt-online-schema-change equiv (with --chunk-size=1000, --sleep=0.1) to compare timing
22:03 logmsgbot: asher synchronized wmf-config/db.php 'pulling db38 from enwiki for revision alter timing'
21:38 maplebed: initial deploy of swift to serve thumbnails is complete
21:27 maplebed: deployed new squid.conf to enable swift for all thumbs with /a/a2 in the URL
20:04 binasher: running mk-slave-prefetch on db1018 which was down for 5 days to see if it can catch up
19:28 maplebed: swift deploy aborted due to squid config issues
18:42 maplebed: deploying squid config change to put swift in service for all thumbnails with /a/a2 in the URL
16:49 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34043 - Change logo at Indonesian Wikibooks'
16:46 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34009 - Translation of project namespace and site-name for ta.wikiquote'
16:22 Reedy: Updated user_former_groups.ufg_group to 32 characters
16:07 logmsgbot: nikerabbit synchronized wmf-config/CommonSettings.php 'Do not add translation reviewers group'
15:58 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'add r'
15:57 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'unset( $wgGroupPermissions[translate-proof] )'
15:55 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34124 - Create a Project namespace on Russian Wikipedia'
15:19 Nikerabbit: that was Bug 34213 - Enabling of Extension:Translate on the Wikimedia Incubator
15:18 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php
15:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34219 - Disable Babel autocreation of categories at the Portuguese Wiktionary'
14:59 Nikerabbit: creating Translate db tables for incubator
11:32 logmsgbot: demon synchronized wmf-config/InitialiseSettings.php 'Fixing vepwiki logo - bug 34222'
11:28 logmsgbot: demon synchronized wmf-config/InitialiseSettings.php 'Fixing vepwiki logo - bug 34222'
11:04 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/ 'i18ndeploy r110738 Translate fixes'
06:29 binasher: rotated all eqiad + anayltic enwiki slaves to replicate from db1017 after db1001 hardware failure
02:32 logmsgbot: LocalisationUpdate completed (1.18) at Mon Feb 6 02:32:41 UTC 2012

February 5

18:29 logmsgbot: reedy synchronized php-1.18/includes/api
16:58 maplebed: restarted lsearchd on search6
16:38 binasher: running alter table add column on db1017 enwiki.revision to benchmark
16:34 logmsgbot: asher synchronized wmf-config/db.php 'returning db35 to service'
14:23 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
13:10 mutante: stopped and started lsearchd once again on search6
12:51 mutante: restarted lsearchd on search6
02:25 logmsgbot: LocalisationUpdate completed (1.18) at Sun Feb 5 02:25:51 UTC 2012
02:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'touch'

February 4

21:30 pgehres: switched cc-status back to enabled on donate.wikimedia.org per nagios and GC email
21:25 pgehres: switched cc-status on donate.wikimedia.org to disable per GlobalCollect outage notice
02:25 logmsgbot: LocalisationUpdate completed (1.18) at Sat Feb 4 02:25:24 UTC 2012
00:02 LeslieCarr: applying loopback filter on cr2-pmtpa

February 3

23:59 LeslieCarr: applying loopback filter on cr1-sdtpa
23:21 binasher: timing the same operation as a normal alter on db1005. expect db lag to get backed up by hours
23:16 binasher: testing and timing pt-online-schema change to dewiki.revision on db1021 (not in rotation)
23:07 LeslieCarr: applying loopback filter on cr2-eqiad
23:06 binasher: upgrading percona-toolkit to 2.02 on all coredbs
22:43 logmsgbot: asher synchronized wmf-config/db.php 'returning db33, 39, 46 to prod'
22:39 binasher: db35 had an iblogfile size inconsistent with other s5 hosts. streaming a hotbackup of db1034 to db35
22:23 binasher: rebooted db35, db39
21:55 logmsgbot: asher synchronized wmf-config/db.php 'pulling db35, 39, 46 for upgrades'
20:52 K4-713: updated production civicrm to r1295
20:45 logmsgbot: asher synchronized wmf-config/db.php 'adding back dbs 13,18,25'
20:32 binasher: upgraded mysql on dbs 13,18,25,33
20:17 logmsgbot: reedy synchronized php-1.18/extensions/SpamBlacklist/SpamBlacklist_body.php 'r110682'
19:23 logmsgbot: asher synchronized wmf-config/db.php 'pulling dbs 13,18,25,26 for upgrades'
19:11 RobH: manutius installed and ready for use
17:26 RobH: updated dns for manutius.mgmt
17:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'touch'
17:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'touch'
16:08 RobH: db41 being reinstalled, appears down but logging to be safe
15:20 mark: Around 14:50 UTC, removed the 3 remaining esams upload squids in the knsq8-15 range from the config. This made ms5 unhappy.
15:13 logmsgbot: reedy synchronized wmf-config/db.php 'Add comment that db40 is parsercache'
13:53 mutante: resetting stats on new wikis per bz 34184: updateArticleCount.php vepwiki --update; updateArticleCount.php pnbwiktionary --update
13:42 mark: Disabled knsq1-15 in PyBal, preparing for decommissioning
03:53 maplebed: moved all the individual puppet files out of place, stopped nagios, and re-ran puppet (at now minus 1.5hrs)
02:24 logmsgbot: LocalisationUpdate completed (1.18) at Fri Feb 3 02:24:54 UTC 2012
00:57 K4-713: re-enabled the donations queue consumer via Jenkins
00:42 K4-713: updated production civicrm to r1293
00:23 logmsgbot: asher synchronized wmf-config/db.php 'moving watchlist/recentchanges back to db12, returning db24 to s2'
00:09 K4-713: Disabled donations queue consumption on aluminium

February 2

23:51 K4-713: updated production civicrm to r1291
23:44 binasher: db12 back up with lucid + current mysql
23:32 binasher: rebooting db12
23:08 logmsgbot: asher synchronized wmf-config/db.php 'pulling db12 from enwiki, temporarily moving watchlist/recentchanges to db54'
23:02 pgehres: K4-713 synchronized production CiviCRM to r1288 on Aluminium
22:59 binasher: db24 upgraded to lucid and current mysql build
22:52 binasher: rebooted db24
22:44 logmsgbot: reedy synchronized wmf-config/ 'Disable VariablePage completely'
22:26 binasher: pulled db24 from s2, preparing to upgrade to lucid
22:19 logmsgbot: asher synchronized wmf-config/db.php 'pulling db24 from s2 for upgrade'
21:37 apergos: started rsync from dataset2 to dataset1001 in screen session as root on dataset1001
21:07 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Drop FundraiserPortal config'
21:07 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Drop FundraiserPortal config'
21:06 RobH: dataset1001 is alive, mostly
19:15 logmsgbot: asher synchronized wmf-config/db.php 'raising db55 weight'
19:08 logmsgbot: asher synchronized wmf-config/db.php 'add db55 - new s5 slave'
18:06 notpeter: doing initial run of puppet on cp1001-1020
17:33 notpeter: reimaging cp1002 and imaging cp1001 and cp1003-1020
16:06 cmjohnson1: disk 15 swap complete on db11
16:05 cmjohnson1: replacing disk 15 on db11
15:55 mark: Running apt-get update && apt-get dist-upgrade && reboot on lvs1
15:40 mark: Running apt-get update && apt-get dist-upgrade && reboot on lvs2
15:10 logmsgbot: reedy synchronized php-1.18/extensions/CodeReview/api/ 'r110574'
14:21 logmsgbot: hashar synchronized php-1.18/includes/UserMailer.php 'work around bug 34158'
14:19 logmsgbot: catrope synchronized php-1.18/extensions/LocalisationUpdate/LocalisationUpdate.class.php 'r110570'
14:10 RoanKattouw: Finally fixed ownership of cache/l10n on scalers , sync-l10nupdate only throws the expected errors, no more perms errors on the scalers
14:09 RoanKattouw: Scalers now have disk space available because php-1.17-test is gone
13:59 logmsgbot: catrope synchronizing Wikimedia installation... : Deleted php-1.17-test on fenari, running scap to delete it on the Apaches as well
13:49 RoanKattouw: Deleting /home/wikipedia/common/php-1.17-test , has been unused for a long time
13:45 RoanKattouw: Deleting /tmp/mw-cache-1.17 on srv219 and srv223
13:44 RoanKattouw: srv219-224 have a full disk according to rsync
13:38 RoanKattouw: Fixing ownership of /usr/local/apache/common-local/php-1.18/cache/l10n on srv191, srv199, srv219-224
13:35 RoanKattouw: Running sync-l10nupdate again to investigate rsync errosr
13:34 logmsgbot: LocalisationUpdate completed (1.18) at Thu Feb 2 13:34:53 UTC 2012
13:12 RoanKattouw: Running l10nupdate by hand to hopefully fix bug 33768
13:11 logmsgbot: catrope synchronized php-1.18/extensions/LocalisationUpdate/LocalisationUpdate.class.php 'Deploy live-hacked version that will hopefully fix bug 33768'
10:00 Tim: reinserted the deleted site_stats row for plwiki
09:35 Tim: killing statistics queries on all s2 slave servers
09:34 logmsgbot: tstarling synchronized php-1.18/includes/SiteStats.php 'disabling even more'
09:32 Tim: on db54, killed all SiteStats queries
09:28 Tim: restarting all apaches
09:26 Tim: disabling SiteStatsInit::articles
09:26 logmsgbot: tstarling synchronized php-1.18/includes/SiteStats.php
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Thu Feb 2 02:06:48 UTC 2012
01:22 AaronSchulz: running "mwscriptwikiset purgeDeletedFiles.php all.dblist --starttime=20120126000000" in a screen on fenari

February 1

23:20 logmsgbot: aaron synchronized wmf-config/CommonSettings.php 'Enabled swift thumbnail purge code'
23:12 logmsgbot: aaron synchronized wmf-config/swift.php 'actually register the hook handler'
22:43 logmsgbot: aaron synchronized php-1.18/includes/filerepo/LocalFile.php
22:26 binasher: streaming a hotbackup of db35 to db55 (new s5 slave)
22:21 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix timezone typo for mr'
22:19 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Configure rights assignments for AFTv5'
22:03 AaronSchulz: Enabled SwiftCloudFiles extension on all wikis, doesn't do anything yet
21:51 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Asia/Kolata isn't valid'
21:48 Jeff_Green: disabling deprecated apache, lighttpd, haproxy, squid, mysql services on loudon
21:38 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Add additional articles category to $wgArticleFeedbackv5DashboardCategory'
21:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'timezone config for new sties'
21:30 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 34120 - Enable translate extension on Wikimania 2012 wiki'
21:29 Reedy: Created Translate tables on wikimania2012wiki
21:28 Jeff_Green: dist-upgrading and rebooting loudon
21:20 logmsgbot: reedy ran sync-common-all
21:09 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'fix timezone'
21:07 logmsgbot: reedy ran sync-common-all
20:55 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'vepwiki config'
20:53 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Undo temp eot change'
20:53 logmsgbot: reedy ran sync-common-all
20:48 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'wrong file'
20:46 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Temporarily enable eot uploads on amwiki'
20:35 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bewikimedia site config'
20:31 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Set $wgArticleFeedbackv5SelectedCTA = 1'
20:29 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'set bewikimedia to en'
20:21 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying ArticleFeedbackv5 updates
20:15 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
19:58 RobH: labs switch ports connected per rt 1882
19:57 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
19:52 RobH: strontium.mgmt repaired per rt2352
19:50 logmsgbot: catrope synchronized php-1.18/cache/interwiki.cdb 'rebuilt interwiki cache'
19:47 logmsgbot: catrope synchronized php-1.18/cache/interwiki-pr.cdb 'rebuilt interwiki cache'
19:47 logmsgbot: catrope synchronized php-1.18/cache/interwiki.cdb 'rebuilt interwiki cache'
19:43 RobH: lab-ex4200-1 back in rack
19:29 logmsgbot: reedy ran sync-common-all
19:20 RobH: pushing apache changes for reedy
18:27 RobH: ganglia1002 back online ready for install
18:26 RobH: ganglia1002 mgmt offline per rt 2247, system was unplugged... no idea why
18:24 cmjohnson1: pulled drive 2 db47
18:15 RobH: cp1019 memory error repaired, now it is ready for OS install
18:14 RobH: cp1017 memory error repaired
17:54 RobH: updated dns for payments boxen renames in eqiad
17:37 RobH: cp1014 memory was improperly installed (from factory?), installed in supported configuration and system is now ready for OS install per RT2351
17:08 RobH: investigating errors on cp1014
17:04 RobH: cp1019 console redirection fixed per rt2353, ready for OS install
16:57 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
16:54 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
16:48 RobH: dataset1001 controller replaced
16:41 cmjohnson1: reseating drive2 in db47
16:37 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wikilove default on fawikis'
16:22 RobH: dataset1001 down for controller replacement
16:07 mark: Removed now obsolete package wikimedia-task-squid from the karmic-wikimedia and lucid-wikimedia APT repositories, and deleted in svn.wikimedia.org
15:45 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Try a quietened dumpInterwiki script'
14:45 mutante: running authdns-update to remove oldusability
14:30 mutante: shutting down "oldusability" linode instance
13:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Point wgInterwikiCache at interwiki.cdb'
13:41 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache copying protocol relative over interwiki.cdb'
13:36 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Kill wmgHTTPSExperiment'
13:35 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Tidy up cache epoch code'
13:30 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Remove some of the wmgHTTPSExperiment related conditionals'
13:27 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Only use the protocol relative interwiki cdb'
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Wed Feb 1 02:06:20 UTC 2012
00:51 binasher: applied articlefeedback v5 schema changes to enwiki, testwiki, en_labswikimedia
00:33 logmsgbot: reedy synchronized php/cache/interwiki-pr.cdb 'Updating interwiki cache'
00:07 logmsgbot: reedy synchronized php/cache/interwiki-pr.cdb 'Updating interwiki cache'
00:05 logmsgbot: reedy synchronized php/cache/interwiki-pr.cdb 'Updating interwiki cache'

January 31

21:44 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/gateway_common/us-states.i18n.php 'r110433'
21:44 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/gateway_common/interface.i18n.php 'r110433'
21:43 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/gateway_common/countries.i18n.php 'r110433'
21:41 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/globalcollect_gateway/globalcollect_gateway.i18n.php 'r110433'
21:41 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/globalcollect_gateway/globalcollect_gateway.alias.php 'r110433'
21:40 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/payflowpro_gateway/payflowpro_gateway.i18n.php 'r110433'
21:39 logmsgbot: awjrichards synchronized php/extensions/DonationInterface/payflowpro_gateway/payflowpro_gateway.alias.php
21:23 notpeter: restarting puppet on emery
20:57 notpeter: temp stopping puppet on emery for testing
19:58 notpeter: on stafford, that is
19:57 notpeter: restarting puppetmaster proc as it's serving up 500s to all clients (well, 3 randomly selected ones...)
18:55 Ryan_Lane: restarted squid and lighttpd on brewster
18:36 RoanKattouw: IRC breakage postmortem: MediaWiki was configured to send UDP packets to .179 (ekrem-old) instead of .178 (ekrem)
18:32 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Change wgRC2UDPAddress to the new ekrem IP'
16:56 mutante: restarted ircd on ekrem once again because we still cant join channels .. problem remains
16:35 mutante: restarted IRC bot on ekrem (needs dependency to start after ircd)
16:30 mutante: ekrem - gets Error 500 on SERVER when running puppet
16:30 logmsgbot: reedy synchronized php-1.18/extensions/SpamBlacklist/SpamBlacklist_body.php 'r110401'
16:28 mutante: ekrem - su -c /usr/local/ircd-ratbox/bin/ircd irc
16:21 mutante: powercycling ekrem - mgmt just showed "Stopping web" and was frozen completely
16:17 RoanKattouw: ekrem suddenly died around 16:03 UTC, breaking the RC IRC feed
15:06 mutante: changed nameservers for wikimedia.pl per RT:2277/bugzilla:33509
09:28 logmsgbot: catrope synchronized php-1.18/includes/Wiki.php 'r110368'
09:27 logmsgbot: catrope synchronized php-1.18/includes/Exception.php 'r110368'
06:34 Tim: added myself to the gerrit "administrators" group
05:23 Tim: the segfaults didn't stop, so I'm disabling wmerrors entirely for now
05:13 Tim: since puppet is broken, disabled wmerrors backtrace logging by adding a separate configuration file in /etc/php5/conf.d and reloading apache
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Tue Jan 31 02:06:09 UTC 2012

January 30

23:56 awjr: synchronized i18n files for DonationInterface on payments cluster to r110342
23:46 Ryan_Lane: moving instances from virt2 to virt1 to rebalance compute cluster
19:48 logmsgbot: awjrichards synchronizing Wikimedia installation... : Syncing CentralNotice to r110026 of trunk, includes important fix for 1.19 compatibility
19:28 logmsgbot: asher synchronized wmf-config/db.php 'raising db54 weight'
18:57 logmsgbot: asher synchronized wmf-config/db.php 'adding db54 to s2'
18:39 mutante: running authdns-update to activate be.wikimedia.org
18:19 logmsgbot: nikerabbit synchronized php-1.18/extensions/Narayam/resources/ext.narayam.rules.as.js 'I18ndeploy r110311 - bug 33924'
18:17 logmsgbot: nikerabbit synchronized php-1.18/extensions/WebFonts/resources/ext.webfonts.fontlist.js 'I18ndeploy r110311 - bug 33599'
18:16 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/ 'I18ndeploy r110310 - Translate help links'
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Mon Jan 30 02:06:42 UTC 2012

January 29

18:55 logmsgbot: reedy synchronized php-1.18/extensions/SiteMatrix/SiteMatrix_body.php
08:42 mutante: restarted lsearchd on search6
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Sun Jan 29 02:06:09 UTC 2012

January 28

02:06 logmsgbot: LocalisationUpdate completed (1.18) at Sat Jan 28 02:06:56 UTC 2012
00:00 binasher: db54 is now replicating from db30

January 27

21:57 pgehres: updates complete, re-enabling queue consumption on jenkins on aluminium
21:42 pgehres: pausing payments queue consumption in jenkins to backup and then run some db updates
21:34 LeslieCarr: applying loopback filter on cr1-eqiad
21:28 RobH: dns update
20:53 ^demon: gallium: clearing /tmp yet again. Aaron claims he's fixing it now
19:37 RobH: reinstalling sq31
18:32 RobH: dns update for fluorine host
17:15 binasher: s2 dbs are a sad lot. streaming hotback of db1034 to db54 to build a new slave
14:37 Jeff_Green: dist-upgrading storage3
13:36 ^demon: gallium: cleaning up /tmp again, tests really need to clean up after themselves.
11:34 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'th wikilogos'
11:29 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'th wikilogos'
11:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33862 - Request for logo change in Tamil Wikiquote'
11:11 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33960 - Import sources for etwiki, etwikisource and etwiktionary'
11:07 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33864 - Flood flag on sr.wiki'
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Fri Jan 27 02:05:24 UTC 2012
00:17 logmsgbot: py synchronized wmf-config/CommonSettings.php 'changing eqiad cp1001-cp1020 IPs to their new, private IPs'

January 26

23:58 logmsgbot: catrope synchronized php-1.18/extensions/MoodBar/ 'Update MoodBar'
23:56 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Adding $wgMoodbarConfig["feedbackDashboardUrl"]'
23:54 logmsgbot: reedy synchronized php-1.18/includes/specials/SpecialBlockList.php 'r110095'
23:28 mark: Deployed squid configs to all squids
23:26 mark: Deploying modified squid configs of modified squid config generator to text.knams
22:14 RobH: poking at puppet change breaking things on sockpuppet puppet runs
21:33 ^demon: gallium: cleared a bunch of junk from /tmp
21:12 Jeff_Green: upgraded storage3 mysqld from 5.1.47 to mysql-at-facebook-r3753
20:17 logmsgbot: asher synchronized wmf-config/db.php 'db37 back in s7'
20:10 Reedy: Created "spoofuser" AntiSpoof table in the central auth database
19:34 logmsgbot: asher synchronized wmf-config/db.php 'pulling db37 from s7 for upgrades'
19:24 RobH: disregard any flapping by mw1001, its my script testbed
18:04 RobH: forcing puppet run on srv199
17:45 RobH: shutting down srv199 for bios tinkering by chris
02:37 RoanKattouw: Started the udp2log process for the AFT logger manually on emery
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Thu Jan 26 02:06:22 UTC 2012
01:01 logmsgbot: asher synchronized wmf-config/db.php 'adding db32 to s1 at low weight, new enwiki snapshot host'
00:33 logmsgbot: asher synchronized wmf-config/db.php 'returning db18, now replicating heartbeat db'
00:28 logmsgbot: asher synchronized wmf-config/db.php 'temporarily pulling db18'
00:11 pgehres: re-enabling recurring donation module and processing in CiviCRM
00:04 logmsgbot: asher synchronized wmf-config/db.php 'adding db26 to s7'
00:03 RobH: shutting down srv151-srv186 per RT 2318 (confirmed not in pybal pools for apache or api)

January 25

23:59 binasher: removed old external store apaches from pybal config
23:50 logmsgbot: asher synchronized wmf-config/db.php 'pulling db26'
23:49 logmsgbot: asher synchronized wmf-config/db.php 'adding db26 to s7'
23:35 pgehres: re-enabled queue consumption for payments through Jenkins
23:35 pgehres: awjr synchronized CiviCRM on aluminium to r1211
23:05 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Checking if wmgDisplayFeedsInSidebar === false rather than true, since it defaults to true in the install file'
22:50 logmsgbot: awjrichards synchronizing Wikimedia installation... : Enabling FeaturedFeeds everywhere
22:41 RobH: updating dns for bellin/blondel db9/10 replacements
22:32 logmsgbot: awjrichards synchronizing Wikimedia installation... : Dark-deploying FeaturedFeeds
22:07 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'fixed spelling mistake fore FeaturedFeeds configuration'
22:05 logmsgbot: awjrichards synchronized wmf-config/CommonSettings.php 'Setting up FeaturedFeeds config; disabled by default'
22:02 logmsgbot: awjrichards synchronized wmf-config/InitialiseSettings.php 'Setting up FeaturedFeeds config; disabled by default'
21:54 LeslieCarr: deactivated selected-paths policy-statement on cr1-eqiad and cr2-eqiad
21:23 Tim: on srv197: compiled and installed a local version of wmerrors for segfault investigation
21:22 binasher: bits caches: running varnish param.set thread_pool_min, thread_pool_max, where min = 15000 / cores / 4 and max = 15000 / cores
20:57 binasher: running "varnishadm param.set thread_pool_max 1875" on mobile varnish servers
20:05 Tim: on srv197: temporarily disabled puppet and enabled core dumps in apache2.conf for segfault flood investigation
19:57 Jeff_Green: running dist-upgrade on payments* and silicon
19:44 Tim: updating TrustedXFF host list using fenari
19:29 RobH: dns update go!
18:50 LeslieCarr: restarted varnish on niobium
18:49 RoanKattouw: Restarted morebots
13:33 logmsgbot: demon synchronized wmf-config/CommonSettings.php 'Change IP address for bnwiki account creation throttle per bug 33900'
11:43 apergos: formey oom (I guess), unresponsive from mgmt console, powercycling.
06:07 pgehres: disabled queue consumption of payments in jenkins until stuck message can be removed from queue
05:49 pgehres: Disabling the processing of recurring payments in CiviCRM until we can add the appropriate payment_method to the queue msgs
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Wed Jan 25 02:06:17 UTC 2012
00:57 binasher: streaming hotbackup of db53 to db32
00:52 binasher: shutting down mysql on db32, going to reconfigure with lvm and reslave
00:45 logmsgbot: asher synchronized wmf-config/db.php 'pulling db32 - this will be the new enwiki pmtpa snapshot host'
00:30 binasher: streaming hotbackup of db37 to db26, preparing to reprovision db26 in s7

January 24

23:30 pgehres: testing
23:28 awjr: Syncing prod CiviCRM on aluminium to r1209
22:39 mark: Disabled quota support on sanger's IMAP server to make Dovecot work again
22:23 mark: Sanger is upgraded to lucid
21:49 RobH: ms-be1 is online! MAN WE ARE AWESOME
21:20 mark: Starting dist-upgrade of sanger
20:50 binasher: pulled db26, rebooting and re-imaging with lucid
20:48 logmsgbot: asher synchronized wmf-config/db.php 'pulling db26 from s1 to reimage'
20:31 cmjohnson1: restarting ms4 for memory testing
20:19 notpeter: spinning up db54-58 for asher
20:09 logmsgbot: asher synchronized wmf-config/db.php 're-weighting s6 dbs'
20:06 logmsgbot: asher synchronized wmf-config/db.php 'adding db43 back to s6 at a low weight'
20:01 logmsgbot: asher synchronized wmf-config/db.php 'raising db53 weight to 400'
19:37 logmsgbot: asher synchronized wmf-config/db.php 'raising db53 weight to 200'
19:33 logmsgbot: asher synchronized wmf-config/db.php 'adding db53 as an enwiki slave at 1/4 normal weight'
19:18 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'switched to Preprocessor_Hash on ocwiki only'
18:52 LeslieCarr: moved cp1001-1040 to private vlan
18:52 LeslieCarr: restarted morebots
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Tue Jan 24 02:05:48 UTC 2012
01:55 Ryan_Lane: fixed reverse dns for labs instances
01:32 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Add account creation throttle increase for bug 33900'
01:07 LeslieCarr: restarting dhcp3-server on brewster
00:54 logmsgbot: tstarling synchronized wmf-config/InitialiseSettings.php 'new rsvg command line option'
00:54 logmsgbot: tstarling synchronized wmf-config/CommonSettings.php
00:50 Tim: upgraded rsvg on all mediawiki-installation servers, for some reason it is installed on all of them
00:23 binasher: streaming a hotbackup of db1006 to db43
00:19 Tim: running apt-get upgrade on image scalers
00:18 Tim: uploaded new rsvg to apt.wikimedia.org, deploying to image scalers

January 23

23:31 binasher: started slaving db53 from db36 (enwiki)
23:16 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php 'update to Mobile Frontend for custom logo support'
23:14 RobH: dns update for a bunch of things
23:12 preilly: push config change for custom logos
23:11 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add MobileFrontend custom logo support'
23:10 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'add MobileFrontend custom logo support'
23:10 RobH: srv187, srv188, srv189 set to false in pybal for api lvs, old servers that will be decommed soon.
21:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Wrapping more long lines'
21:18 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33864 - Flood flag on sr.wiki'
21:15 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33864 - Flood flag on sr.wiki'
20:55 binasher: rebooting db1029 with proprietary binary only huawei kernel module installed, for short term ssd evaluation
20:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33899 - Request for Narayam in outreach.wikimedia.org'
20:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33841 - Re-point $wgLogo to on zhwiki'
20:34 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33166 - Creation of a new namespace for Malagasy Wiktionary'
19:59 RobH: added dns info for ms-be1 but not pushing change until leslie pushes her
19:19 RobH: locke seems ok
19:07 RobH: locke down
19:06 RobH: going to shutdown locke now for the move
18:05 logmsgbot: nikerabbit synchronized php-1.18/extensions/WebFonts/ 'i18ndeploy r109836'
18:04 logmsgbot: nikerabbit synchronized php-1.18/extensions/CodeReview/ui/CodeRevisionView.php 'i18ndeploy r109836'
17:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Push changes for aswiki by Santhosh'
13:29 rainman-sr: restarted search1, search3, search4 - not sure why they were dead
02:22 Ryan_Lane: installing new version of nginx in eqiad
02:22 Ryan_Lane: restarted nginx in pmtpa and esams
02:21 Ryan_Lane: installed new version of nginx in pmtpa and esams
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Mon Jan 23 02:06:35 UTC 2012

January 22

23:46 Ryan_Lane: changing nginx config to use the escaped useragent
23:45 Ryan_Lane: repooling ssl4
23:39 Ryan_Lane: restarting nginx servers
23:31 Tim: pushed out unstripped version of wmerrors
23:26 Ryan_Lane: testing new nginx package on ssl4
22:41 mark: cp1042 stuck on disk i/o, rebooting
22:28 mark: Restarted varnish backend on cp1041 and cp1042
21:38 preilly: remove SOPA banner
21:38 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php 'update to mobile frontend to remove sopa banner'
18:42 Tim: running apt-get upgrade on searchidx2
18:39 Tim: running apt-get upgrade on snapshot2 and snapshot4
18:32 Tim: running apt-get upgrade on snapshot1 to get wikimedia version of php-wikidiff2
06:36 Ryan_Lane: repooling ssl1004, depooling ssl4
05:44 Ryan_Lane: depooling ssl1004
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Sun Jan 22 02:05:22 UTC 2012
00:51 Reedy: Resent mediawiki-cvs commit emails from r109549 through r109704

January 21

21:45 logmsgbot: reedy synchronized php-1.18/includes/api/ApiParse.php 'r109695'
19:19 logmsgbot: reedy synchronized wmf-config/flaggedrevs.php 'More for bug 29742'
19:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33215 - Enabling transwiki import on sa.wiktionary+'
18:55 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Setting wgNamespaceRobotPolicies for th projects'
18:35 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Cleanup wgEnableDnsBlacklist, enable for th projects'
18:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Setting timezone for th projects'
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Sat Jan 21 02:05:53 UTC 2012
00:27 logmsgbot: asher synchronized wmf-config/db.php 'raising db52 load to 400'
00:22 logmsgbot: asher synchronized wmf-config/db.php 'raising db52 load to 200'
00:17 logmsgbot: asher synchronized wmf-config/db.php 'adding db52 to enwiki, load 100'

January 20

23:45 LeslieCarr: ms6 sdc is undergoing fsck due to wrong fs type, bad option, bad superblock, or other on /dev/sdc1,
23:44 binasher: moving north america bits back to eqiad
23:34 binasher: moved bits eqiad to pmtpa (via scenarios/normal/bits-geo.wikimedia.org)
23:32 LeslieCarr: killed carnish on niobium , cpu load seems to be going down
23:30 LeslieCarr: reloading arsenic
23:10 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Wrap some stupidly long lines'
22:52 LeslieCarr: rebooting cp3001
22:30 LeslieCarr: reloading cp3001
22:24 LeslieCarr: restarting networking on cp3001
22:03 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33789 - Enable botadmin usergroup on ml.wikipedia'
22:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33789 - Enable botadmin usergroup on ml.wikipedia'
21:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33789 - Enable botadmin usergroup on ml.wikipedia'
20:40 RobH: dns servers all still online after update
20:40 RobH: dns update for dataset1001
18:33 LeslieCarr: knsq9 has recovered post-reboot
18:21 LeslieCarr: knsq9 will be rebooted as it is dead, dead, dead
17:48 LeslieCarr: knsq9 is dead/overloaded
15:11 mutante: knsq30 still has bad disk, powering down again
15:07 mutante: powercycling knsq30 after replacing cable
15:04 ^demon: fixed post-commit hook on formey email notifs to point to correct smtp server
14:41 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
14:27 logmsgbot: reedy synchronized php/cache/interwiki.cdb 'Updating interwiki cache'
13:03 mutante: reinstalling srv199
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Fri Jan 20 02:05:21 UTC 2012
01:17 awjr: updated representative/zipcode mapping and some contact info for a handful of reps/senators for CongressLookup r109598
01:12 binasher: started another hotbackup of db38 to db52
01:08 logmsgbot: asher synchronized wmf-config/db.php 'pulling db52'
00:45 logmsgbot: asher synchronized wmf-config/db.php 'doubling db52 weight'
00:38 logmsgbot: asher synchronized wmf-config/db.php 'lowering db52 weight'
00:32 binasher: deployed new enwiki slave, db52
00:32 logmsgbot: asher synchronized wmf-config/db.php 'setting db52 to full weight'
00:19 logmsgbot: asher synchronized wmf-config/db.php 'adding new enwiki slave db52, with a low weight'
00:08 preilly: push weekly mobile frontend update
00:08 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php 'weekly update to Mobile Frontend'

January 19

23:56 Ryan_Lane: changed global roles netadmins and sysadmins to be virtual static groups in ldap that autopopulate with any user that has objectclass=novauser
23:15 Tim: rebuilt wikidiff2 with package name php-wikidiff2, removed lucid package php5-wikidiff2 from apt using "reprepro remove"
22:52 Tim: recompiled wikidiff2 and put the new version up on apt.wikimedia.org
21:51 Jeff_Green: starting conversion of fundraisingdb 'faulkner' tables from myisam to innodb, expect replication delays
21:12 binasher: starting slaving db52 from db36, running hotbackup of db32 to db53
20:36 RobH: dataset1001 shut down for later use
20:27 RobH: dataset1001 mgmt online
20:15 RobH: dataset1001.mgmt even
20:15 RobH: updating dns for dataset1mgmt
20:03 LeslieCarr: testing
20:00 LeslieCarr: testing
05:16 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ 'new sopa banner'
05:14 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Enabling anon editing for enwiki'
05:12 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Enabling page creation for users'
05:08 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ 'new sopa banner'
05:06 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ 'new sopa banner'
05:00 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Removing all SOPA changes, excluding editing for anons, and page creation'
04:57 binasher: flushing mobile varnish caches
04:53 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ 'new sopa banner'
04:47 Ryan_Lane: Preparing InitialiseSettings for renabling Wikipedia. DO NOT SCAP, DO NOT PUSH InitializeSettings
04:32 logmsgbot: awjrichards synchronizing Wikimedia installation... : Deploying CongressLookup changes for the lifting of the blackout
03:11 Ryan_Lane: bringing virt1 back up
03:01 Ryan_Lane: rebooting virt1 to ensure hardware virtualization is enabled in the bios
02:30 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109477'
02:29 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/CongressLookup.i18n.php 'r109477'
02:06 Ryan_Lane: rebalance of gluster volume completed
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Thu Jan 19 02:05:55 UTC 2012
02:05 Ryan_Lane: rebalancing instance gluster volume. network may get saturated for a while.
01:55 Ryan_Lane: added virt1 and virt4 to instance volume for gluster
01:17 Reedy: Leaving cleanupUploadStash.php running against commonswiki in a screen session as me on hume
01:16 binasher: removing extra mobile varnish capacity - it wasn't needed
01:13 awjr: updated zip code/representative data on enwiki to r109465
01:01 Ryan_Lane: installed python-argparse on stat1
00:54 binasher: running a hot backup of db32, streaming to db52
00:22 Ryan_Lane: removing virt1 cname
00:21 Ryan_Lane: rebuilding virt1 as a nova compute node
00:20 LeslieCarr: changed vlan for virt1 eth0
00:18 Ryan_Lane: cleared lighttpd logs on brewster and restarted squid and lighttpd
00:05 logmsgbot: asher synchronized wmf-config/db.php 'returning db32 to normal weight'

January 18

23:59 logmsgbot: asher synchronized wmf-config/db.php 'returning db32 at a low weight'
23:50 binasher: rebooting db32 for mysql/kernel upgrades
23:49 logmsgbot: asher synchronized wmf-config/db.php 'pulling db32 from s1 for mysql/kernel upgrades'
23:44 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109457'
23:02 maplebed: increased the size of db11's logical volume for /a from 500G to 800G.
22:27 binasher: enwiki master changed to db36 - MASTER_LOG_FILE='db36-bin.000599', MASTER_LOG_POS=15773827
22:26 logmsgbot: asher synchronized wmf-config/db.php 'done swapping s1 master to db36'
22:25 binasher: swapping s1 master to db36
22:24 logmsgbot: asher synchronized wmf-config/db.php 'starting swap of s1 master to db36, s1 in read-only'
22:13 logmsgbot: asher synchronized wmf-config/db.php 'returning db36 to normal weight'
22:07 logmsgbot: asher synchronized wmf-config/db.php 'returning db36 at a low weight'
21:59 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109440'
21:58 binasher: rebooting db36, upgrading kernel + mysql
21:56 logmsgbot: asher synchronized wmf-config/db.php 'pulling db36 from s1 for mysql/kernel upgrades'
21:54 Ryan_Lane: installing python-wurfl on stat1
21:35 Ryan_Lane: installing geoip-bin geoip-database libgeoip1 python-geoip on stat1
21:13 logmsgbot: asher synchronized wmf-config/db.php 'returning db38 at prior weight'
21:05 Reedy: Run patch-ug_group-length-increase.sql on all wikis
21:04 Reedy: Run patch-uploadstash_chunk.sql on all wikis
21:03 Reedy: Run patch-jobs-add-timestamp.sql on all wikis
20:55 awjr: update cl_zip5 table for CongressLookup to data in r 109408
20:43 Reedy: Manually running cleanupUploadStash.php against commonswiki
20:42 Reedy: Manually ran cleanupUploadStash.php against enwiki
20:31 binasher: db38 in service at a low weight with new lucid kernel and current mysql build
20:30 RobH: shutting down db17, confirmed not in db rotation and has no mysql instance active
20:30 logmsgbot: asher synchronized wmf-config/db.php 'returning db38 at a lower weight'
20:28 logmsgbot: asher synchronized wmf-config/db.php 'pulling db38 again'
20:26 logmsgbot: asher synchronized wmf-config/db.php 'returning db38 to service'
20:17 LeslieCarr: rebooting spence as it's once again gone crazy
20:11 binasher: pulled db38, rebooting for kernel and mysql upgrades
20:11 logmsgbot: asher synchronized wmf-config/db.php 'pulling db38 from s1 for upgrade'
20:04 RobH: mw1102 coming down for mainboard replacement
20:03 LeslieCarr: killing puppet processes on spence
19:28 Reedy: Run patch-jobs-add-timestamp.sql on enwiki (jobs table is empty!)
19:01 mutante: mw1108 - OS installed, added to puppet, finished catalog run, free for use
18:37 mutante: pxe booting mw1108, OS install
18:36 mutante: fixed DHCP config for mw1108 on brewster, had the string "Failed to connect to 10.65.1.108." where the MAC address should have been.
18:27 RobH: searchidx1001 memory replaced per rt 2208
18:20 mutante: tried to PXE boot mw1108 but no DHCP offers received
18:15 RobH: searchidx1001 memory being replaced
18:14 LeslieCarr: re-preffing tele2 routes
18:11 RobH: db1004 hard disk replaced per rt#2140, rebuilding
17:40 LeslieCarr: Draining HE to perform maintenance on the physical port
16:57 logmsgbot: reedy synchronized php-1.18/extensions/CongressLookup/SpecialCongressLookup.php 'r109395'
14:45 mark: Changed service IP addresses of lists.wikimedia.org in DNS to US prefixes
14:40 mark: Disabled hold_domains on sodium and lily
14:28 mark: Setup lily to route lists.wikimedia.org mails to sodium
14:21 mark: rsync complete. Running dpkg-reconfigure mailman on sodium
13:43 logmsgbot: demon synchronized php-1.18/extensions/CongressLookup/SpecialCongressLookup.php 'r109362'
13:38 mark: Started rsync of selected mailman directories under /var/lib/mailman from lily to sodium
13:37 mark: Removed all test messages on the exim4 queue on sodium
13:37 mark: Created /var LVM snapshot on lily
13:35 mark: Stopped lighttpd on lily
13:34 mark: Stopped mailman on lily and sodium
13:30 mark: Set hold_domains = lists.wikimedia.org on lily, to hold new lists mails on the queue
13:30 mark: Starting mailman migration
13:03 mutante: restarted pdns on ns0
08:49 logmsgbot: neilk synchronizing Wikimedia installation... :
07:45 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109336'
06:28 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109324'
06:24 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109322'
06:08 logmsgbot: awjrichards synchronized php/extensions/CongressLookup/SpecialCongressLookup.php 'r109319'
05:32 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Disabling CentralNotice for simplewiki'
05:25 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Disabling moodbar on enwiki'
04:59 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Disabling all editing for enwiki for SOPA blackout'
04:50 Ryan_Lane: queuing up changes for totally disabling edits. DO NOT SCAP! DO NOT SYNC InitialiseSettings!
04:47 Ryan_Lane: Editing enwiki's MediaWiki:Robots.txt to disallow BannerController for SOPA blackout
04:45 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php
04:44 Ryan_Lane: Disabling anon editing and page creation by users on enwiki for SOPA blackout
04:33 logmsgbot: neilk synchronized wmf-config/InitialiseSettings.php 'enable CongressLookup on enwiki'
04:01 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ApplicationTemplate.php 'update version'
03:56 logmsgbot: neilk synchronizing Wikimedia installation... : deploying CongressLookup (for i18n reasons, not deploying to enwiki)
03:28 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Removing restriction of display title for SOPA landing pages'
02:35 binasher: cp1039-40 are now in service for mobile wikipedia
02:04 logmsgbot: LocalisationUpdate completed (1.18) at Wed Jan 18 02:04:57 UTC 2012
01:35 RobH: cp1040 and cp1036 ready for use
01:33 RobH: cp1037, cp1038, cp1039 os installed, varnish partitions mounted, and puppet run

January 17

22:47 binasher: ram only varnish instance now running on marmontel in front of apache/wordpress
22:07 Ryan_Lane: installing memcache on marmontel
20:54 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33769 - Allow bureaucrats to remove sysop rights at Bashkir Wikipedia'
20:13 mutante: en.planet updates were stuck. reason was corrupted cache causing "bsddb.db.DBPageNotFoundError" which broke update script. solution was to kill stuck updates, delete files in cache dir and run update manually
19:59 logmsgbot: reedy synchronized php-1.18/includes/Feed.php 'r109197'
19:36 Ryan_Lane: added Cite extension to labscosnole
19:29 logmsgbot: reedy synchronized php-1.18/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r109186'
18:05 RobH: blog is instantly faster
18:05 RobH: theme updated on blog along with settting limit back to 20 comments per page
17:46 RobH: aware of blog slowdowns, work is being done
17:35 mutante: also upgraded drac firmware on mw1081 & mw1099 (fixes mgmt console problem)
16:45 mutante: upgrading drac firmware on mw1108
15:54 RobH: db43 rebooting
15:27 RobH: db7 shutting down for decom, not listed in db for any clusters, load .01
11:05 logmsgbot: neilk synchronized wmf-config/ExtensionMessages-1.18.php 'added CongressLookup to ExtensionMessages-1.18 for i18n'
11:04 logmsgbot: neilk synchronized wmf-config/extension-list 'added CongressLookup to extension-list for i18n'
10:30 logmsgbot: neilk synchronizing Wikimedia installation... : deploying CongressLookup. We are not deploying to any live wiki, just test, but this is to make i18n work
10:28 logmsgbot: neilk synchronized wmf-config/InitialiseSettings.php 'added CongressLookup to InitialiseSettings'
10:25 logmsgbot: neilk synchronized wmf-config/CommonSettings.php 'added CongressLookup require'
05:47 maplebed: marmontel has now replaced hooper as blog.wikimedia.org
05:26 maplebed: installing the mysql client on marmontel to test connectivity to the DB
05:16 Ryan_Lane: installing php-apc on marmontel
04:52 RobH: another dns update for servermgmt
04:18 Ryan_Lane: installing varnish on hooper
02:29 Ryan_Lane1: that last message was in regards to hooper
02:29 Ryan_Lane1: temporarily disabled puppet, since the apache configuration was manually modified
02:06 logmsgbot: LocalisationUpdate completed (1.18) at Tue Jan 17 02:06:00 UTC 2012
01:52 Ryan_Lane: installed w3 total cache in wordpress on hooper
01:51 Ryan_Lane: installing tidy on hooper
01:51 Ryan_Lane: installing php-apc on hooper
01:31 Ryan_Lane: powercycling hooper
00:44 neilk_: neilk just added config change to set caching for banners on testwiki to 0. Should have no effect anywhere else.
00:39 logmsgbot: neilk synchronized wmf-config/CommonSettings.php

January 16

23:59 logmsgbot: tstarling synchronized docroot/bits/robots.txt 'removed rule intended for the itwiki protest, left in accidentally'
22:06 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Switching permissions back to normal on testwiki'
22:02 logmsgbot: laner synchronized wmf-config/InitialiseSettings.php 'Testing disabling edits on testwiki'
22:02 Ryan_Lane: testing disabling edits for testwiki
21:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bug 33763 DoubleWiki on frwikiversity'
20:33 Ryan_Lane: pushing floating address changes to virt0
19:23 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 32182 - enable articlefeedback extension on spanish wikipedia'
19:14 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33759 - Change sitename for lb.wiktionary'
18:48 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/ 'i18ndeploy'
18:47 logmsgbot: nikerabbit synchronized php-1.18/extensions/WebFonts/ 'i18ndeploy'
18:45 logmsgbot: nikerabbit synchronized php-1.18/extensions/Narayam/resources/ext.narayam.rules.as.js 'i19ndeploy'
18:44 logmsgbot: nikerabbit synchronized php-1.18/skins/common/shared.css 'i18ndeploy'
17:17 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33508 - Enable Rollback group on id.wiki'
17:05 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable autoconfirmed reupload on incubatorwiki'
17:00 Reedy: srv221 and srv222 are out of space on /
17:00 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Disable autoconfirmed reupload on incubatorwiki'
16:41 logmsgbot: reedy synchronized wmf-config/flaggedrevs.php 'Kill labs setting stuff'
16:40 logmsgbot: reedy synchronized wmf-config/flaggedrevs.php 'Kill labs setting stuff'
16:37 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33662 - Change project namespace for lb.wiktionary'
16:35 Reedy: Ran namespaceDupes on fawikisource
16:34 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33662 - Change project namespace for lb.wiktionary'
16:28 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33708 - Add alias to fa wikisource'
16:26 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33708 - Add alias to fa wikisource'
16:06 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33758 - Arabic numerals on Arabic Wiktionary'
15:55 cmjohnson1: shutting down srv199 for main board replacement
15:09 mutante: torrus was broken (RT:2279) and did not start due to corrupted berkeleydb, used db4.8_recover, service started again
12:38 mark: Running puppet on freshly installed sodium
09:15 apergos: cleaned up /tmp on srv223... seems like cleanup once an hour by cron isn't often enough any more, scalers are doing too much work
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Mon Jan 16 02:05:05 UTC 2012

January 15

02:45 logmsgbot: reedy synchronized php-1.18/extensions/CentralNotice/CentralNotice.db.php 'r108949'
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Sun Jan 15 02:05:26 UTC 2012

January 14

23:37 Ryan_Lane: shutting down virt1
20:29 Ryan_Lane: stopping opendj on virt1
20:07 Ryan_Lane: stopping pdns on virt1
02:15 Reedy: That was me testing something
02:14 logmsgbot: LocalisationUpdate failed: SVN update of extensions failed
02:04 logmsgbot: LocalisationUpdate completed (1.18) at Sat Jan 14 02:04:46 UTC 2012
01:00 Ryan_Lane: brought pdns on virt1 back up
00:53 Ryan_Lane: stopping pdns on virt1 again to test dns
00:19 Ryan_Lane: powercycling formey

January 13

22:40 Ryan_Lane: force running puppet on all instances
22:40 Ryan_Lane: re-generated certificates for all instances
22:40 Ryan_Lane: deleted all puppet certificates on all instances
22:31 LeslieCarr: restarting ganglia1001
21:40 Ryan_Lane: changing virt1 to be a cname of virt0
21:17 Ryan_Lane: killed pdns on virt1
20:40 Ryan_Lane: stopped pdns on virt1
20:32 logmsgbot: reedy synchronized php-1.18/extensions/Contest/specials/ 'r108843'
20:28 Ryan_Lane: changed NS records for wmflabs.org and wmflabs to point to virt0
20:28 Ryan_Lane: changed recursor to point wmflabs domain to virt0
20:16 K4-713: synchronized payments cluster to r108833
19:30 Ryan_Lane: shutting down virt1 to ensure migration was completed
19:12 LeslieCarr: restarting gmond on cp1043
18:55 LeslieCarr: hard powercycling ms1002
18:48 LeslieCarr: rebooting ms1002 due to kswapd 100% cpu bug https://bugs.launchpad.net/ubuntu/+bug/721896
16:44 mutante: added alswiktionary & alswikibooks to closed.dblist
16:43 logmsgbot: dzahn synchronized closed.dblist
16:33 mutante: syncing InitialiseSettings.php after changing as wiki namespace per bug 33507
16:33 logmsgbot: dzahn synchronized ./wmf-config/InitialiseSettings.php
16:07 mutante: updated blog theme and installed a plugin per RT:2271
14:34 mutante: srv191 - has now fresh OS, re-issued puppet certs, ran puppet, restart memcached, etc. - all back in monitoring
12:59 mutante: PXE booting srv191, installing OS
07:45 Ryan_Lane: disassociated and reassociated some floating IP addresses, to fix NAT issues. Some NAT rules went missing.
07:43 Ryan_Lane: added a grant for mediawiki in the database to fix labsconsole mediawiki outage
07:42 Ryan_Lane: fixed memcached port in mediawiki configuration on labsconsole to fix slowness issue
03:29 Ryan_Lane: switching labsconsole.wikimedia.org address to point to virt0
02:58 Ryan_Lane: dns server is up on virt0
02:57 Ryan_Lane: switched active ldap server in labs to virt0, for nova itself. instances still need to be re-pointed
02:56 Ryan_Lane: switched rabbitmq server in labs to virt0
02:56 Ryan_Lane: switched mysql masters for labs to virt0
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Fri Jan 13 02:05:29 UTC 2012
01:47 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bug 33469 - Enable rollback function for editor group kawiki'
01:39 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bug 33507 for aswiki'
01:09 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bug 33556 - ArticleFeedback settings on Chinese wikipedia'
01:02 logmsgbot: reedy synchronized closed.dblist 'Closing en_labswikimedia, de_labswikimedia, liquidthreads_labswikimedia (resync)'
01:02 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'bug 33468 - Email notifications for eswikibooks'
00:33 Reedy: That was only touch
00:33 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php
00:32 Reedy: srv223 is also out of diskspace
00:29 Reedy: srv219 is out of diskspace
00:28 logmsgbot: reedy synchronized closed.dblist 'Closing en_labswikimedia, de_labswikimedia, liquidthreads_labswikimedia'
00:17 Ryan_Lane: stopping puppet on all virt nodes

January 12

21:54 Ryan_Lane: relabeled port at virt0
21:54 Ryan_Lane: moved new virt0 from squid vlan to public-services2
21:43 Ryan_Lane: rebuilding mobile2 as virt0
21:43 Ryan_Lane: Adding back mgmt info for mobile1, changing mobile2 to virt0
21:11 Ryan_Lane: rebuilding mobile1 as virt0
21:08 Ryan_Lane: renaming mobile1 to virt0
20:54 binasher: installing percona-toolkit on few remaining hardy dbs
20:26 cmjohnson1: shutting down srv178-189 for decommissioning
20:14 binasher: granted the "process" priv to nagios@localhost on all production db clusters
20:07 logmsgbot: reedy synchronized php-1.18/includes/specials/SpecialSearch.php 'r108751'
20:07 LeslieCarr: reassigning ports on asw-b-sdtpa
17:00 notpeter: stop sodium to do manual reinstall
16:33 RobH: adjusting all power strip humidity sensor 2 (floor level) to 12% humidity, as the center rack has the proper levels, floor levels always are low in humidity.
16:17 mutante: after a config change to nrpe_local.cfg and puppet applying the change, the service was not resrted but for some reason all nagios-nrpe-server caught SIGTERM. manually applying the same config change does not cause problems. that caused a Nagios outage until nrpe servers were started again (via dsh)
16:04 mutante: starting nagios-nrpe-server on ALL via dsh to speed up nagios recovery
15:33 mutante: starting nagios-nrpe-server on srv's via dsh
02:04 logmsgbot: LocalisationUpdate completed (1.18) at Thu Jan 12 02:04:31 UTC 2012
00:48 preilly: pushing quick fix for special random
00:48 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php 'update to mobile frontend to fix random link'
00:41 LeslieCarr: added ganglia1002 and ganglia1001 to dns

January 11

23:18 RobH: searchidx1001 offline and powered down until replacement memory arrives (2012-01-13) rt 2208
22:56 RobH: poking searchidx1001 for memory error
22:45 RobH: mw1108 online and ready for install per rt2253
22:42 RobH: mw1099 repaired, ready for os install per rt2252
22:39 RobH: mw1081 ready for install rt2251
22:32 RobH: no its not ;]
22:16 Reedy: lists.wikimedia.org is down
21:53 logmsgbot: reedy synchronized php-1.18/includes/api/ 'r108683'
21:36 RobH: psw1-eqiad mgmt connected
21:24 RobH: leslie is handling the ganglia not starting back up issue even though i caused it to die, yay me
21:22 RobH: updated dns for neon/cobalt to ganglia1001/1002
21:17 RobH: ganglia offline for a moment, sorry folks
21:17 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying MoodBar changes
21:16 RobH: i just took nickel offline by mistake
20:58 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Change shorturl preefix default'
20:57 logmsgbot: reedy synchronized php-1.18/extensions/ShortUrl/ 'r108680'
20:50 RoanKattouw: Applying MoodBar schema changes (index addition and column addition) on all wikis
20:44 logmsgbot: catrope synchronized php-1.18/extensions/ArticleFeedbackv5/ 'Updating AFTv5 to trunk staet'
20:14 Jeff_Green: adjusted firewall rules on payments* to restore ganglia reporting since we switched to nickel
20:11 RoanKattouw: Created AFTv5 tables on testwiki
20:09 logmsgbot: catrope synchronized php-1.18/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r108666'
19:53 cmjohnson1: shutting down srv191 for new install
19:52 cmjohnson1: replaced HDD srv191
19:47 logmsgbot: catrope synchronized php-1.18/resources/startup.js 'touch'
19:46 logmsgbot: catrope synchronized wmf-config/InitialiseSettings.php 'Enable AFTv5 on testwiki'
19:40 RobH: mw1103 hardware issues, disregard nagios flapping
19:37 RobH: mw1102 offline due to bad mainboard until replacement arrives tomorrow or next
19:30 RobH: working on mw1102, disregard flapping
18:40 notpeter: running authdns-update on dobson to pick up new dns temps
18:28 RobH: lvs1003 repaired, now needs install and setup. rt1549 and rt 2241
18:26 notpeter: gracefulling apache on spence to deactivate nmis.w.o (abandoned install of nedi)
16:23 mark: Started rsync of lily:/var/lib/mailman/archives to sodium (in a screen on sodium)
15:49 mark: Started rsync of lily:/var/lib/mailman/data to sodium (in a screen on sodium)
15:39 logmsgbot: reedy synchronized php-1.18/includes/ 'r108626'
15:34 logmsgbot: reedy synchronized php-1.18/includes/ 'revert r108625'
15:32 logmsgbot: reedy synchronized php-1.18/includes/ 'r108625'
15:23 logmsgbot: reedy synchronized php-1.18/extensions/CodeReview/ 'r108623'
15:22 logmsgbot: reedy synchronized php-1.18/extensions/ArticleFeedbackv5/ 'r108623'
15:22 logmsgbot: reedy synchronized php-1.18/extensions/ApiSandbox/ 'r108623'
15:20 logmsgbot: reedy synchronized php-1.18/resources/mediawiki.action/ 'r108622'
15:19 logmsgbot: reedy synchronized php-1.18/includes/ 'r108622'
14:11 mutante: nagios https now serves real SSL cert
14:09 mutante: fixed Apache VirtualHost warnings on spence, NameVirtualHost *:443 in ports.conf, <VirtualHost *:443> in sites-available,..
02:04 logmsgbot: LocalisationUpdate completed (1.18) at Wed Jan 11 02:04:39 UTC 2012
01:42 preilly: pushing updates to Zero Rated Mobile Access extension
01:41 logmsgbot: preilly synchronized php-1.18/extensions/ZeroRatedMobileAccess/ 'push updates to ZeroRatedMobileAccess extension'
01:18 preilly: only activate Zero Rated Mobile Access Extension for test wiki
00:38 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add ZeroRatedMobileAccess extension only on test'
00:34 preilly: pushing ZeroRatedMobileAccess extension to production
00:34 logmsgbot: preilly synchronized wmf-config/InitialiseSettings.php 'add ZeroRatedMobileAccess extension'
00:34 logmsgbot: preilly synchronized wmf-config/CommonSettings.php 'add ZeroRatedMobileAccess extension'
00:33 logmsgbot: preilly synchronized wmf-config/extension-list 'add ZeroRatedMobileAccess extension'
00:30 preilly: push ZeroRatedMobileAccess extension
00:30 logmsgbot: preilly synchronized php-1.18/extensions/ZeroRatedMobileAccess/ 'initial push of ZeroRatedMobileAccess extension'
00:23 LeslieCarr: applying new loopback filter to cr1-eqiad - higher risk of issues

January 10

23:58 preilly: weekly mobile frontend push
23:58 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ 'weekly update to mobile frontend'
22:10 lcarr: broke ganglia redirect on nickel, fixing with next push
19:45 LeslieCarr: stopping gmetad on spence and unmounting the tmpfs drive
18:32 LeslieCarr: restarting gmetad on nickel
11:39 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/MessageGroups.php 'Translate bugfix r108500'
07:20 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/TranslateEditAddons.php 'r108497'
02:05 logmsgbot: LocalisationUpdate completed (1.18) at Tue Jan 10 02:05:15 UTC 2012
01:31 binasher: all varnish servers have been upgraded to 3.0.2
00:38 RobH: db1004 pd8 set to offline per rt 2140, will place call to dell for replacement
00:23 RobH: correction for typo, mw1102, not mw1002
00:23 binasher: repooled cp3002
00:23 RobH: mw1002 coming down for hw testing rt 1656
00:19 binasher: depooling cp3002, upgrading varnish

January 9

23:57 binasher: testing varnish 3.0.2 upgrade on cp3001 (bits)
23:42 binasher: adding two new mobile cache servers running varnish 3.0.2 (cp104[12]) to the m.wiki eqiad vip
22:26 LeslieCarr: ganglia moved to new nickel server
21:36 LeslieCarr: changing gmond source for ganglia3-tip
21:19 RobH: snapshot1001-1004 mgmt online
21:05 RobH: updating dns with snapshot1001-1004 primary ip info
20:56 RobH: updating dns with snapshot1001-1004 mgmt
20:30 logmsgbot: catrope synchronized php-1.18/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r108470'
20:29 logmsgbot: catrope synchronized php-1.18/extensions/ArticleFeedbackv5/api/ApiArticleFeedbackv5.php 'r108470'
20:25 LeslieCarr: replacing the ops@ alias with the new ops list on mchenry as people keep forgetting to email the new list
19:57 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/Translate.php 'Deploy r108469 - bugfix for Translate'
19:43 RobH: torrus dead, kicking
19:32 logmsgbot: nikerabbit synchronized wmf-config/CommonSettings.php 'Updating Translate config 2/2'
19:32 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Updating Translate config 1/2'
19:16 logmsgbot: nikerabbit synchronized php-1.18/includes/parser/Parser.php 'Deploying r108461'
19:07 logmsgbot: catrope synchronized php-1.18/extensions/UploadWizard/resources/mw.UploadWizardLicenseInput.js 'r108459'
18:55 logmsgbot: nikerabbit synchronized php-1.18/extensions/Translate/ 'Deploying translate r108451'
18:54 logmsgbot: nikerabbit synchronized php-1.18/languages/messages/MessagesEn.php 'Updating messagesEn'
18:42 logmsgbot: nikerabbit synchronized php-1.18/extensions/ParserFunctions/ParserFunctions.i18n.magic.php 'Deploying r108449'
18:34 logmsgbot: nikerabbit synchronized php-1.18/extensions/WikimediaMessages/WikimediaGrammarForms.php 'Deploying r108433'
18:30 logmsgbot: nikerabbit synchronized p/extensions/WebFonts/ 'Updating WebFonts r108447'
18:21 logmsgbot: nikerabbit synchronized php-1.18/extensions/Narayam/ 'Syncing Narayam'
18:18 Nikerabbit: running Narayam preference migration script
17:30 jeremyb: the time is now 17:30:30 UTC
17:20 RoanKattouw: Installing (!) NTP on wikitech
17:16 jeremyb: the time is now 17:19:30 UTC
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Mon Jan 9 02:04:47 UTC 2012

January 8

23:20 Reedy: For some reason cp1001-1042 weren't listed in CommonSettings.php XFF, but (at least) 1042 was in service, meaning edits were attributed to it
23:18 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Add cp1001-cp1041'
23:10 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'Add cp1042 to XFF'
21:47 rainman-sr: killed broken search indexer thread on searchidx1 (please note searchidx1 is no longer in use!), and restarted incremental indexing on searchidx2 which was somehow broken
21:43 rainman-sr: someone started incremental updating on searchidx1 ??!!
14:54 apergos: removed old puppet lockfile on brewster, ran by hand
14:47 apergos: cleared out some very large squid logs on brewster, (basically all of them) plus lighty logs, disk was full. restarted squid manually
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Sun Jan 8 02:05:11 UTC 2012
00:43 tfinc: killing long running show_bug.cgi procs on kaulen

January 7

22:30 Reedy: Users reporting slowness while editing. dberror.log shows a few mysql errors for enwiki master and slaves. Few errors on other wikis, mainly enwiki
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Sat Jan 7 02:05:09 UTC 2012

January 6

23:22 RobH: working rt1549 lvs1003 may flap, it is presently not in service due to possible hdd failure
22:55 binasher: db22 is back in s4
22:55 logmsgbot: asher synchronized wmf-config/db.php 'adding db22 back to s4'
21:41 RobH: db1029 powering back up with ssd testing hardware installed
21:35 RobH: db1029 coming down for ssd testing
21:26 RobH: cp1014 and cp1019 hdd controller cables replaced (removed for testing controllers), both can be used normally
21:19 binasher: restoring db22 from a live hotbackup of db1038
21:18 RobH: es1002 back ready for service use per #2220: replace original RAID card in es1002
21:05 binasher: putting db51 into production as an s4 slave
21:05 logmsgbot: asher synchronized wmf-config/db.php 'adding db51 as an s4 slave'
20:57 binasher: started slaving db51 off of db31
20:21 RobH: rt2226 - redeploy db22 for asher
20:19 RobH: db22 reinstalled and booting into OS. No puppet runs yet, now its Asher's problem ;]
20:04 RobH: db22 reinstalling
19:24 binasher: started innodb hot backup of db1038 to db51
18:43 maplebed: s4 database rotation complete. outage duration 36 minutes.
18:37 maplebed: pushed out new db.php setting s4 to read-write
18:37 logmsgbot: ben synchronized wmf-config/db.php
18:35 maplebed: db31 made read-write as the new master for s4
18:31 maplebed: old master for s4 log file db22-bin.000106 log pos 631618956
18:30 maplebed: new master for s4: db31, log file db31-bin.000213 log pos is 205612709
18:24 logmsgbot: asher synchronized wmf-config/db.php 'setting s4 to read only, preparing to make db31 master'
18:21 Reedy: Commons having db issues, db22 (s4 master) has a disk issue
16:02 apergos: restarted lilghty on dataset2
16:01 Reedy: HTTP server (lighttpd?) seems to be down on dataset2
15:46 RoanKattouw: Removing gs_* files in /tmp on srv220 that are >30 min old
15:44 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33556 - ArticleFeedback settings on Chinese wikipedia'
15:43 RoanKattouw: Removed /tmp/mw-cache-1.17 and /tmp/mw-cache-1.17-test on srv220
15:41 Reedy: srv220 / is at 100% usage
15:41 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33556 - ArticleFeedback settings on Chinese wikipedia'
14:34 mutante: saw the log about cp1043/44 being deliberately left broken, but requirement in varnish.pp also broke others, fixed on sq67,68,69 (gerrit change 1802)
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Fri Jan 6 02:05:01 UTC 2012
01:25 binasher: puppet is being deliberately left broken on cp1043 and 1044 until tomorrow
01:23 binasher: backend varnish instance on cp1042 running 3.0.2 is in production for 1/3 of mobile requests

January 5

22:15 preilly: small fix for iPhone vary support
22:15 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php
21:39 Ryan_Lane: rebooting virt1
21:01 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wmgShortUrlPrefix'
21:01 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wmgShortUrlPrefix'
20:08 Reedy: Created ShortUrl tables on testwiki
20:07 logmsgbot: reedy synchronizing Wikimedia installation... : Update extensionmessages
20:05 logmsgbot: reedy synchronized wmf-config/CommonSettings.php 'wmgUseShortUrl'
20:04 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'wmgUseShortUrl'
20:02 logmsgbot: reedy synchronized php-1.18/extensions/ShortUrl 'Pushing ShortUrl files out'
19:08 notpeter: restarting dhcpd on brewster
18:45 preilly: pushing fix for js error on production
18:45 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/ApplicationTemplate.php
18:45 logmsgbot: preilly synchronized php-1.18/extensions/MobileFrontend/javascripts/application.js
18:00 mutante: tarin - added "#includedir /etc/sudoers.d" to sudo config, needs to read /etc/sudoers.d/nrpe for Nagios RAID check
17:49 logmsgbot_: hashar: gallium: cleaned /tmp . Our test suites leak a large amount of files :D
17:49 ^demon: removed chuck norris plugin from jenkins, restarted
16:48 mutante: payments4 - 25 running nginx procs cause a warning - but normal and just raise limit?
16:15 mutante: people claim it was "completely resolved with "2.6.38-10 backport from PPA." (add-apt-repository ppa:kernel-ppa/ppa ...). wanna try that? (or just reboot ms1002 pls)
15:49 mutante: quotes on kswapd problem (that also appeared on other servers): "has nothing to do with swap space or memory".."the kernel process which swaps tasks".."means the kernel is spending more time context switching tasks than it is actually executing the tasks".."you're chasing a ghost if you're trying to tune your swap/memory environment"
15:45 mutante: ms1002 - kswapd 100% CPU - but no swap used and free memory left - this looks like https://bugs.launchpad.net/ubuntu/+bug/721896 again
15:39 mutante: Nagios check_ntp does stuff like: overall average offset: 0 -> NTP OK: Offset unknown| -> NTP CRITICAL: Offset unknown (even though this bug was supposed to be fixed in a version before the one we use)..sigh
15:34 mutante: dataset1 - date was off by ~ 27 hours. known issues RT 216 & 1345 with hardware clock, additionally though Nagios NTP check is still buggy (possibly due to leap seconds ;P) -> http://tech.akom.net/archives/27-Nagios-check_ntp-quits-working-in-2009-with-Offset-unknown.html)
15:14 mutante: lvs1004 - puppet didnt run since 12 hours, looked stuck, "already in progress" on every run. rm /var/lib/puppet/state/puppetdlock, restart puppet agent, finished fine in a few seconds. maybe puppet bug 2888,5246 or related
14:57 mutante: magnesium - memcached runs on default port 11211, but we run all the others on 11000, this causes Nagios CRIT. Is it supposed to run here? (was also on -l 127.0.0.1 only, but init script starts it on all)
14:55 Jeff_Green: searchidx1 /a reached 100%, did the "space issues" maintenance procedure from wikitech search documentation
14:39 mutante: same on srv193
14:35 mutante: srv290 - before restart memcached was running with -m 64 and -l 127.0.0.1 for some reason, causing Nagios CRIT, now it looks like others and recovered
14:32 mutante: restarting memcached on srv290
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Thu Jan 5 02:05:03 UTC 2012

January 4

23:27 logmsgbot: catrope synchronizing Wikimedia installation... : Deploying MoodBar and MarkAsHelpful changes
22:39 Tim: taking srv280 for action=purge slowness investigation
21:20 Ryan_Lane: deploying LdapAuthentication 2.0a and OpenStackmanager 1.3 to virt1
21:13 RoanKattouw: Applying schema changes to moodbar_feedback_response on all wikis (drop index, create index, add column)
19:36 notpeter: restarting dhcpd on brewster
19:13 RobH: dns update successful and none of them fell over
19:12 Reedy: r108070 even
19:12 logmsgbot: reedy synchronized php-1.18/extensions/CentralAuth/specials/ 'r107070'
19:11 RobH: updating dns for mgmt of ms-fe1/2 and other new servers in tampa, as well as search boxen in eqiad
19:04 mutante: srv199 boots but without eth0, NIC1 is Enabled in BIOS but MAC Address "Not Present" - creating hardware ticket
18:55 logmsgbot: catrope synchronized php-1.18/extensions/ArticleFeedbackv5/modules/jquery.articleFeedbackv5/jquery.articleFeedbackv5.js 'r108064'
18:43 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Disable AFTv5 bucketing tracking again'
18:38 mutante: powercycling srv199
18:33 logmsgbot: catrope synchronized php-1.18/resources/startup.js 'touch'
18:30 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Actually bump version number'
18:28 logmsgbot: catrope synchronized php-1.18/resources/mediawiki/mediawiki.user.js 'Revert live hack'
18:24 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'and bump the version number too'
18:22 logmsgbot: catrope synchronized wmf-config/CommonSettings.php 'Enable tracking for AFTv5 bucketing'
18:06 mutante: duplicate nagios-wm instances on spence (/home/wikipedia/bin/ircecho vs. /usr/ircecho/bin/ircecho) killed them both, restarted with init.d/ircecho
18:00 logmsgbot: catrope synchronized php-1.18/resources/mediawiki/mediawiki.user.js 'Live hack for tracking a percentage of bucketing events'
17:52 mutante: knsq11 is broken. boots into installer, then "Dazed and confused" at hardware detection (NMI received for unknown reason 21 on CPU 0). -> RT 2206
17:38 mutante: powercycling knsq11
11:31 logmsgbot: catrope synchronized php-1.18/extensions/ClickTracking/ClickTracking.hooks.php 'r108017'
08:44 logmsgbot: nikerabbit synchronized php-1.18/includes/specials/SpecialAllmessages.php 'r107998'
07:40 Tim: fixed puppet by re-running the post-merge hook with key forwarding enabled, and then started puppet on ms6
07:32 Tim: on ms6.esams: fixed proxy IP address and stopped puppet while I figure out how to fix it
03:25 Tim: experimentally raised max_concurrent_checks to 128
03:17 Tim: on spence in nagios.cfg, reduced service_reaper_frequency from 10 to 1, to avoid having a massive process count spike every 10 seconds as checks are started. Locally only as a test.
02:27 Ryan_Lane: I should clarify that I removed 10.2.1.13 from /etc/network/interfaces, it's still properly bound to lo
02:24 Tim: on spence: setting up logrotate for nagios.log and removing nagios-bloated-log.log
02:22 Ryan_Lane: removing manually added 10.2.1.13 address from lvs4
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Wed Jan 4 02:04:57 UTC 2012
01:43 Nemo_bis: Last week slowness: job queue backlog now cleared on !Wikimedia Commons and (almost) English !Wikipedia http://ur1.ca/77q9b
01:02 logmsgbot: reedy synchronized php-1.18/includes/ 'r107978'
00:45 logmsgbot: reedy synchronized php-1.18/extensions 'r107977, r107976'
00:39 Tim: running purgeParserCache.php on hume, deleting objects older than 3 months
00:38 logmsgbot: reedy synchronized php-1.18/includes/specials/ 'r107975'
00:29 logmsgbot: tstarling synchronizing Wikimedia installation... :
00:27 logmsgbot: reedy synchronized php-1.18/extensions/Nuke/ 'r107974'
00:25 logmsgbot: reedy synchronized php-1.18/extensions/ 'r107970'

January 3

23:00 Tim: on spence: restarting gmetad
22:58 logmsgbot: reedy synchronizing Wikimedia installation... : Pushing r107953, r107955, r107956, r107957
22:47 LeslieCarr: stopping and then starting apache2 on spence to try and lower load
22:29 RobH: added in the lo addres to lvs4, now its working and generating thumbnails
22:09 logmsgbot: reedy synchronizing Wikimedia installation... : Push r107938 r107948
21:45 RobH: ganglia graphs will have missing data for past 30 to 40 minutes
21:45 RobH: spence back online, ganglia and nagios confirmed operational
21:38 RobH: resetting spence and dropping to serial to try to fix it
21:25 RobH: nagios and ganglia down due to spence reboot, system still coming back online
21:21 RobH: spence is unresponsive to ssh and serial console, rebooting
21:14 LeslieCarr: resetting DRAC 5 on spence for management connectivity
21:05 binasher: that fixed it. but how did that happen?
21:05 binasher: ran ip addr add 10.2.1.22/32 label "lo:LVS" dev lo on lvs4
19:36 logmsgbot: reedy synchronized php-1.18/skins/common/images/ 'r107930'
17:36 mutante: killing more runJobs.php / nextJobDB.php processes on a bunch of servers (/home/catrope/badjobrunners)
17:26 RoanKattouw: Stopping job runners on the following DECOMMISSIONED servers: srv151 srv152 srv153 srv158 srv160 srv164 srv165 srv166 srv167 srv168 srv170 srv176 srv177 srv178 srv181 srv184 srv185
15:55 RobH: torrus back, took forever to recompile
15:53 logmsgbot: reedy synchronized wmf-config/InitialiseSettings.php 'Bug 33485 - Enable WikiLove in si.wikipedia'
15:52 Reedy: Created wikilove tables on siwiki
15:46 RobH: torrus deadlocked, kicking
14:00 RoanKattouw: Restarting job runners on srv242 and mw25, those are the last ones that are stuck
13:57 RoanKattouw: Restarting all job runners that are stuck
13:48 RoanKattouw: Restarting job runner on srv236, seems to be stuck
02:02 logmsgbot: LocalisationUpdate completed (1.18) at Tue Jan 3 02:05:21 UTC 2012

January 2

23:36 Reedy: Seems to potentially be an issue with job runners, enwiki backed up to over 90k over the last week or so. Needs investigating
23:18 logmsgbot: tstarling synchronized php-1.18/includes/parser/Parser.php 'r107856'
22:58 logmsgbot: tstarling synchronizing Wikimedia installation... :
18:08 logmsgbot: nikerabbit synchronized wmf-config/InitialiseSettings.php 'Bug 33368: WebFonts on bpywiki'
18:05 logmsgbot: nikerabbit synchronized php-1.18/languages/messages/ 'i18ndeploy r107843'
18:04 logmsgbot: nikerabbit synchronized php-1.18/extensions/WebFonts/WebFonts.i18n.php 'i18ndeploy r107843'
16:58 mutante: installed SiteMap extension on Bugzilla - soon bugs should be googleable (per BZ:33406)
16:33 mutante: upgraded Bugzilla from 4.0.2 to 4.0.3 (http://www.bugzilla.org/releases/4.0.3/release-notes.html#v40_point) (RT #2194)
14:47 mutante: cleaned out gammu spool to stop sms bomb - sorry. deamon runs again now though..
14:36 mutante: fixed gammu-smsd on spence per wikitech "Nagios#Fixing_the_USB_dongle" (sending out queued SMS now )
14:30 mutante: puppet ran on spence, ganglia also seems ok despite the errors i logged before. gammu-smsd cant find device again though
14:03 mutante: spence / gmetad - RRD_update .. illegal attempt to update using time .. last update time is .. (minimum one second step)
13:57 mutante: gmond complains about missing kernel modules on spence when trying to start on boot
13:54 mutante: spence down, no ssh, no mgmt output, powercycling it ..
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Mon Jan 2 02:04:47 UTC 2012
00:08 logmsgbot: tstarling synchronized php-1.18/includes/media/SVGMetadataExtractor.php 'r107792'

January 1

21:28 Ryan_Lane: restarted pdns-recursor on dobson
21:26 Ryan_Lane: restarted pdns on ns2 about an hour ago
09:46 apergos: restarted lucene search on srch 10, 11, then later on 3,4,9,1
09:35 apergos: removed log.1 from /a/search/logs on search6, it was 35gb
03:55 Tim: fixed broken package on search7 and search11
02:01 logmsgbot: LocalisationUpdate completed (1.18) at Sun Jan 1 02:04:30 UTC 2012
01:36 Tim: adjusted FD limit in /etc/init.d/lsearchd on all search servers with sed
01:34 Tim: increased FD limit on search6 and restarted lsearchd
00:46 Tim: removed some logs on search6 to fix /a disk space exhaustion