Re: Site stability status

Thanks for your hard work, Renoir!


On Tue, Sep 2, 2014 at 12:58 PM, Renoir Boulanger <renoir@w3.org> wrote:

> Hi all,
>
> I realized after a talk with Doug that I forgot to give a proper site
> status. Here’s the summary.
>
> During the last month we had a few intermittent down times [0][1] and
> the cause was a set of software performance problems.
>
> The heart of the problem was because we were using software that were in
> their end-of-life. They had known memory leaks problems and weren’t
> patched anymore.
>
> My work of the last two weeks was then to rebuild server configuration
> to use up to date versions. I had a few quirks with proper set of
> software versions, but it should be fixed now.
>
> * MediaWiki from 1.22wmf5 to 1.24wmf16
> * Ubuntu server VMs from 10.04 LTS to 14.04 LTS for both app and db
> nodes (6 VMs in total)
> * MySQL 5.1 to Percona "XTraDB" Cluster MySQL 5.6
>
> There are a few things to tidy up, but the overall stability should be
> improved.
>
> The next steps i see for our server setup are:
>
> * Upgrade Piwik and Bug Genie versions
> * Create a mirror of our complete setup[2]
> * Upgrade Ubuntu 12.04 LTS to 14.04 LTS nodes
> * Finish up removing hardcoded passwords, publish on github
> * Improve logging system
>
>   [0]: http://status.webplatform.org/post/94249032910
>   [1]: http://status.webplatform.org/post/94144481705
>   [2]:
> http://lists.w3.org/Archives/Public/public-webplatform/2014Aug/0058.html
>
> --
> Regards,
>
> Renoir Boulanger  |  Developer operations engineer
> W3C  |  Web Platform Project
>
> http://w3.org/people/#renoirbhttps://renoirboulanger.com/  ✪
> @renoirb
> ~
>
>

Received on Tuesday, 2 September 2014 17:42:53 UTC