Thank you for your patience.
November 2011 Archives
Thank you for your patience.
No new registrations or updates will be possible during this window.
Existing .fm domain names will not be impacted
The downtime should be only about 10 minutes
pemvzwin08
pemvzwin11
pemvzwin13
Affected VPS's:
pemvzwin08:
78.153.208.25
78.153.208.106
78.153.208.113
78.153.209.121
78.153.209.208
78.153.210.105
78.153.210.116
78.153.211.6
78.153.211.18
78.153.211.20
78.153.211.25
78.153.211.34
78.153.209.204
78.153.208.139
pemvzwin11:
78.153.196.51
78.153.196.53
78.153.196.54
78.153.196.55
78.153.196.56
78.153.196.59
78.153.196.61
78.153.196.64
78.153.196.67
78.153.196.72
78.153.196.73
78.153.196.76
78.153.196.79
78.153.196.83
78.153.196.146
78.153.196.181
78.153.196.182
78.153.196.183
78.153.196.184
78.153.196.189
78.153.196.190
78.153.196.196
pemvzwin13:
78.153.196.130
78.153.196.131
78.153.196.133
78.153.196.135
78.153.196.138
78.153.196.144
78.153.196.145
78.153.196.152
78.153.196.155
78.153.196.153
78.153.196.161
78.153.196.167
78.153.196.170
78.153.196.171
78.153.196.172
78.153.196.173
78.153.196.177
78.153.196.178
78.153.196.180
78.153.196.62
78.153.197.33
This will affect the following shared hosting servers:
pemlinweb01.blacknight.com
pemlinweb07.blacknight.com
Update: Updated expected window to more accurately reflect time required to perform the maintenance tasks.
Firstly the idea of webspaces can now really be forgotten. They have moved all ftp, website settings (php, error logging etc) has been moved to Sites & Domains > $Domainname we've updated the knowledge base article for this, please see:
https://support.blacknight.ie/index.php?_m=knowledgebase&_a=viewarticle&kbarticleid=274
Secondly the Linux web hosting now defaults to adding all domains to one webspace. It will now automatically populate the location. e.g.
if you are adding the domainname "myidea.ie" the location will be set to /myidea.ie - this means that you should not be able to accidentally overwrite another website on the system. It also means that for new domains you should be reliably able to guess its location within your ftp location.

pemlinweb32 is currently offline due to a ddos attack.
Update 15:00 The server is back online
Update 19:20 The server has been attacked again, we have removed access to the server again. We will update again once we have restored access
The server is back online
What's affected:
Sites on pemlinweb59 or the following IPs:
78.153.214.54 78.153.214.141 78.153.214.171 78.153.214.193
Sites on pemlinweb60 or the following IPs:
78.153.214.55 78.153.214.128 78.153.214.172
We'll have more information when it's available.
Update 15:00 Both servers are back online now
mysql873.cp.blacknight.com
mysql870.cp.blacknight.com
We anticipate they will return in 5-10 minutes, but will update this post should there be any further developments.
Update @ 09:38: Both servers are now fully back up. Apologies for any inconvenience.
This was resolved at 4am
We've noticed some errors in the logs that indicated a vzfs (virtuozzo file system) issue on this server. These inconsistancies could cause issues if they're not fixed immediately. So we've taken this server offline and we're running a utility against it to resolve the problem.
We endeavour to bring the server back online as soon as possible.
The server is back online now
The downtime should be only about 10 minutes
pemvzwin03
pemvzwin08
pemvzwin09
pemvzwin11
pemvzwin13
Affected VPS's:
pemvzwin03:
78.153.208.229
78.153.208.230
78.153.208.248
78.153.209.12
78.153.208.96
78.153.208.185
78.153.209.23
78.153.209.26
78.153.209.29
78.153.209.43
78.153.209.44
78.153.209.48
78.153.209.88
78.153.209.98
78.153.209.97
78.153.209.117
78.153.209.123
78.153.209.153
78.153.208.19
pemvzwin08:
78.153.208.25
78.153.208.106
78.153.208.113
78.153.209.121
78.153.209.208
78.153.210.105
78.153.210.116
78.153.211.6
78.153.211.18
78.153.211.20
78.153.211.25
78.153.211.34
78.153.209.204
78.153.208.139
pemvzwin09:
78.153.211.74
78.153.211.77
78.153.211.80
78.153.211.81
78.153.211.83
78.153.211.84
78.153.211.94
78.153.211.92
78.153.211.113
78.153.211.121
78.153.211.125
78.153.211.127
78.153.211.132
78.153.211.141
pemvzwin11:
78.153.196.51
78.153.196.53
78.153.196.54
78.153.196.55
78.153.196.56
78.153.196.59
78.153.196.61
78.153.196.64
78.153.196.67
78.153.196.72
78.153.196.73
78.153.196.76
78.153.196.79
78.153.196.83
78.153.196.146
78.153.196.181
78.153.196.182
78.153.196.183
78.153.196.184
78.153.196.189
78.153.196.190
78.153.196.196
pemvzwin13:
78.153.196.130
78.153.196.131
78.153.196.133
78.153.196.135
78.153.196.138
78.153.196.144
78.153.196.145
78.153.196.152
78.153.196.155
78.153.196.153
78.153.196.161
78.153.196.167
78.153.196.170
78.153.196.171
78.153.196.172
78.153.196.173
78.153.196.177
78.153.196.178
78.153.196.180
78.153.196.62
78.153.197.33
Update 07:55
All servers are back online and VPS's started
Update @ 10:55: The container has had to be taken offline for a raid resync. We will update with more information shortly.
Update 11:25 We are going to migrate the server to another hardware node as this will provide the quickest solution. This will take 45 mins approximately
Update 13:25 The server is coming back online on the original hardware now, we will look at moving it to new hardware in the coming days.
Once it is back up we will update this page.
Update: This post previously referenced the wrong server. Updated to reflect correct MySQL server affected.
Update @ 15:10: This server is now back up.
When: Tuesday 22/11/11 18:00
Whats Affected:
pemlinweb31.blacknight.com 81.17.254.38
pemlinweb01.blacknight.com 81.17.254.70
pemlinweb07.blacknight.com 81.17.254.88
The replacments should be completed within 45 minutes.
We'll update here once the servers are back online
When: 16:20
VPS's Affected:
78.153.210.17
78.153.210.60
78.153.210.115
78.153.210.144
78.153.210.153
78.153.210.168
78.153.211.145
78.153.211.146
78.153.211.149
78.153.211.150
78.153.211.151
78.153.211.158
78.153.211.162
78.153.211.173
78.153.211.180
78.153.208.27
78.153.208.134
78.153.208.194
78.153.208.214
78.153.209.16
78.153.209.77
78.153.209.113
78.153.209.192
78.153.209.235
78.153.211.65
We are migrating PEMVZMPS19 to new hardware tonight to allow for both drives to be replaced.
When: 21st November at 20:00. The maintenance window will be up to 2 hours for each node. pemlinweb67 from 20:00 and pemlinweb68 from 21:30
What's affected?
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
Whats affected:
- PEMVZMPS33
- pemlinweb32.blacknight.com (81.17.254.44)
- pemlinweb33.blacknight.com (81.17.254.48)
Update 13:50
The server is booting and running an fscheck
Update 14:00
Both pemlinweb32 and pemlinweb33 are back online now
When: 09:45
VPS's Affected:
78.153.210.17
78.153.210.60
78.153.210.115
78.153.210.144
78.153.210.153
78.153.210.168
78.153.211.145
78.153.211.146
78.153.211.149
78.153.211.150
78.153.211.151
78.153.211.158
78.153.211.162
78.153.211.173
78.153.211.180
78.153.208.27
78.153.208.134
78.153.208.194
78.153.208.214
78.153.209.16
78.153.209.77
78.153.209.113
78.153.209.192
78.153.209.235
78.153.211.65
Update 10:10
The drive has been replaced and the server has booted, the VPS's are now starting, they should all be started shortly.
Update 10:35
All VPS's are now started
We have taken both facilities offline as there are some database issues and inconsistencies that need to be repaired. Since new orders and renewals etc., use the database we have disabled everything while working on the issue
If you need to change nameservers on a domain name please contact our support desk.
UPDATE 0921 - we brought the system back online around midnight last night
We are currently experiencing difficulties with one of our Linux Shared Hosting servers. We have dispatched an Engineer to investigate.
Update - 19:40: This server has returned to normal, and we apologise for the unexpected downtime.
Our store and billing are currenly unavailable. We are investigating, and hope to have these back online as soon as possible.
Update: 13:50 - these services have now returned to normal.
We are migrating PEMVZMPS19 to new hardware tonight to allow for both drives to be replaced.
When: 17th November at 20:00. The maintenance window will be 1 hour.
What's affected?
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
We will update this blog post once everything is completed.
UPDATE 19:48 - This has been rescheduled due to current issues that have arisen.
Update: Friday 18th @ 11:00: This issue was fully resolved last night at 21:00.
As part of our ongoing commitment to bring you the latest in technology improvement, we are performing a major upgrade to our Windows VPS platform. The next server will be upgraded next Thursday, 17/11/11, at 23:00 GMT.
The following VPSs are on this server, and will be offline for the period of this upgrade, which should take no more than 90-120 minutes. We apologise for this downtime, and hope no inconvenience will be caused.
Effected VPSs
78.153.208.160
78.153.209.232
78.153.209.161
78.153.210.20
78.153.210.160
78.153.209.166
78.153.210.203
78.153.210.81
78.153.208.217
78.153.208.163
78.153.210.28
78.153.210.88
78.153.209.142
78.153.210.126
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
We'll update this post once completed. Thanks as always for your understanding.
UPDATE: This is completed.
What's affected?
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
Update 09:40
The host server pemvzmps19 is back up but we are still working on finding what is causing the issue so pemlinweb67 and pemlinweb68 are not yet back online fully. We will update once they are.
Update 10:40
Both servers have been online now for an hour without any issues, we will keep a close eye on this server.
What's affected?
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
Update 15:05
Both servers are back online, we will investigate the issue.
As a result of this work the registry will be offline for new registrations and updates between 1300 and 1500 UTC on Saturday November 12th 2011
Existing domain names will continue to resolve as normal
As part of our ongoing commitment to bring you the latest in technology improvement, we are performing a major upgrade to our Windows VPS platform. The next server will be upgraded next Monday, 14/11/11, at 23:00 GMT.
The following VPSs are on this server, and will be offline for the period of this upgrade, which should take no more than 90-120 minutes. We apologise for this downtime, and hope no inconvenience will be caused.
Effected VPSs
78.153.208.229
78.153.208.230
78.153.208.248
78.153.209.12
78.153.208.96
78.153.208.185
78.153.209.23
78.153.209.26
78.153.209.29
78.153.209.43
78.153.209.44
78.153.209.48
78.153.209.88
78.153.209.98
78.153.209.97
78.153.209.117
78.153.209.123
78.153.209.153
78.153.208.19
There is a slight drawback - you can no longer send e-mail without explicitly turning on SMTP authentication in Outlook, Apple Mail etc.
We have gathered a few articles for you to help you out on this:
1) https://support.blacknight.ie/index.php?_m=knowledgebase&_a=viewarticle&kbarticleid=472
- this gives you instructions on how to enable this in most common e-mail clients.
OR
2) http://wiki.blacknight.com/index.php/SMTP
- this gives you a list of SMTP servers for most Irish ISPs, down the bottom is a link to a page that has a list of ISPs for many ISPs outside of Ireland.
An ISP is the company that provides your broadband, cable, dsl, dial up, satellite net connect etc.
I've requested that a newsletter go out to all customers with the above information in it as a technical notice for the next few runs.
Update: 09:35
Webmail users using Firefox 4 or IE 7 or above can use our "alternative" webmail application here: https://altmail.blacknight.com
We are migrating PEMVZMPS67 to new hardware tonight to allow for both drives to be replaced.
When: Tuesday 8th November at 21:00. The maintenance window will be 1 hour.
What's affected?
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
We will update this blog post once everything is completed.
Update 21:55
pemlinweb68 is complete and back online, pemlinweb67 is still ongoing but should complete shortly.
Update 22:30
The migration has completed now and both servers are online, sorry for the delay but it was important to get it completed.
During this period updates, registrations and whois will not be available.
Existing .eu domain names will continue to resolve as normal.
Update: 10:45: We have been busy working away trying to resolve this issue. At the moment however the cause of the issue isn't at all clear and as such it's proving difficult to get a fix for it. This system has been stable since the last round of hardware updates we put in place a couple of weeks ago. The only thing that has changed is that Compellent our SAN vendor swapped out the iSCSI cards in the two SAN controllers yesterday. This should not have had a negative impact on the system however it appears that it has. So we're working with them to find the cause of the problem.
Update: 11:40: We are still working on this issue. It's the top most priority for our engineering and support teams this morning.
Update: 13:00: This issue is still on going. Unfortunately we've not had made any progress in finding a cause for the slow down.
Update: 13:25: We're having people getting abusive to our helpdesk staff. This is not helpful for anyone. The issue at the moment is that while they are working properly they're not fulfilling their duties and thus causing this service issue for you all. We are still working on this issue and we are investigating all avenues currently including blocking certain services to see if it's some sort of inbound attack on the mail servers.
Update: 14:30: I've removed some of the previous commentary from this thread as it was causing people issues. I'm sorry about that. Right now we're on the phone to Compellent and we're hoping that their escalation time has found something in the logs we sent them.
Update: 14:55: We have currently taken the entire system offline completely and we're examining each part. The Qmail cluster is made up of 4 service groups.
1) SAN + NFS server
2) POP/IMAP/SMTP servers
3) Authentication - LDAP and WHOSOND
4) Mail Scanning / Anti Spam prevention.
We're fairly confident that Groups 3 and 4 are functioning perfectly as we're not seeing the type of issues you would see if they were having issues. So that leaves the pop/imap and SAN systems. The SAN system had some cards replaced yesterday by Compellent so we immediately thought that this was the cause of the problem and asked them to give us back the old cards. They're told this isn't possible. We've had 2 x 1hour long phone calls with them so far today where we went over all the metrics on the SAN. Disk latency, network latency, volume latency, IO throughput etc. Everything on the SAN looks normal. So that leaves the NFS server + NFS clients. We would normally see upward of 300Mbit/s of traffic between the clients and the server, today this is showing as 10-20Mbit/s so it's fairly obvious that the problem is entered around NFS. This is where we are now concentrating all of our efforts. To figure out what is causing this and to fix it.
Update: 15:45: A number of people who forward their email onto gmail / hotmail etc have been getting their email all day. This is expected. SMTP inbound i.e. mail delivery from others into us is working ok. The issue is the pop/imap connections from your e-mail clients and are problematic. For those that asked, all the servers are back online now. We're still seeing the performance issue after the tweaks / changes we've made but forwarding should be working ok right now. Again please accept our sincerest apologies for the issues this is causing you all.
Update: 16:50: Sorry about the previous comment. It was a direct response to some customers having issues with forwarding. E-mail is still down but no e-mail will be lost. Again sorry about this outage, it's the single longest outage we have ever had. It is the number 1 priority and has been all day.
Day Changed to November 8th:
Update: 08:45: POP and IMAP have been switched back on. During the night we moved back to mailstore1 and we also converted the mail system away from Courier-IMAP to Dovecot. This change we hope brings significant performance improvements through better indexing and logging. SMTP will take a while to turn back on unfortunately. ETA for smtp is now 11am.
Update: 09:15: People are saying to our helpdesk that they're having problems with IMAP connections. They can't sync folders. We're investigating this now.
Update: 09:45: POP3 seems to be working ok for most customers. IMAP is intermittent and we're trying to figure that out. Webmail relies heavily on IMAP, so when IMAP is fully working so will Webmail.
Update: 10:44: We are working our way through some file permission issues. Once we get these sorted we'll have everything backup. The main issue right now is e-mail delivery and IMAP/Webmail access. We are not going to make the 11am Deadline on this unfortunately. The ETA is being pushed onto Midday.
Update: 11:55: Right now we have e-mail flowing from the general internet and our inbound scanning boxes into Qmail. So people who are able to get onto POP3 will begin receiving email in the next while. We estimate around 1,000,000 or so e-mails are queued for delivery, a lot of which will bounce because they're spam messages. So far we've seen around 250k of these go into the local delivery queues on the mail servers. So things are progressing all beit slower than you would like. The reason for this is that we have an abnormally large number of users trying to get their e-mail because of the prolonged outage.
Update: 12:45: We have been working with Parallels to get Dovecot working properly. Dovecot is built to work with NFS storage and is programmed in such a way that it is NFS friendly. We have got it working on 2 of the 4 mail servers currently and we've processed well over 500k mails and delivered them to your inboxes. Some of you may also have noticed that SMTP is working but it's still a little patchy due to the high volume of inbound e-mail however it's not as bad as it was at 11:30. There is still a fair bit of e-mail to get through right now but the system is handling it very well.
Update: 14:10: All e-mail has been delivered to their respective mailboxes at this stage. POP3 is working but not on SSL. IMAP and SMTP are intermittent still but we're close to having those resolved. Also as mentioned earlier IMAP being offline or not working fully means webmail isn't working yet. ETA for full restoration is another 2 hours unfortunately.
Calling support to look for an update is futile as the engineering team are putting the updates here first and passing the url onto support. They do not know more than the information is being put here.
Update: 16:15: We believe we have nailed down the right combination of limits for IMAP to be stable. We made some changes about 15 minutes ago and we're monitoring connections to it right now. Once we deem it stable we'll turn webmail back on as we're acutely aware that a number of customers only use Webmail.
Update: 17:10: We turned webmail back on at 16:20 this evening. We've been monitoring it closely and so far we're happy with the performance. As of now this issue is finally resolved.
A few points to note:
1) if you used to pop mail and leave it on the server, you'll have to re-download all your e-mail. This is unfortunately unavoidable.
2) we have moved away from Couier-IMAP to Dovecot. Dovecot does some very smart caching on the mail server and this appears to be doing great things for performance.
3) pop before smtp is no longer supported. We appreciate that this might cause issues for customers but unfortunately we can't turn it back on.
We will post an update on our main company blog and here on the status blog with further information about this issue once we've had time to diagnose it fully and produce a report for the management team here.
All Services should be functioning normal as of 16:20 this evening.
While the issue at hand isn't causing a lot of problems it could do in the coming hours. Compellent have dispatched an engineer with all new cards for both controllers. The SAN in this instance provides the storage for all of the Qmail services that we currently operate. The connections from the san to the mailstore and other services are working ok. i.e. all 8 logical paths to the storage network are working but we are seeing a high error rate on the ports in the controllers which could cause performance issues.
At approx 17:00 today we'll be replacing all the cards in the SAN while the it is online. This can be done because it's built in a resilient fashion. We'll post an update in a few hours once we have firm timelines from Compellent.
This affected inbound e-mail into our spam scanning server that sits in the Cloud. No e-mail was lost during this window. It was resolved around 00:00 last night.
VPS affected:
78.153.208.024
78.153.208.183
78.153.209.035
78.153.209.104
78.153.209.186
78.153.209.221
78.153.210.019
78.153.210.059
78.153.210.102
78.153.208.061
78.153.210.208
78.153.210.131
78.153.211.022
78.153.209.236
78.153.208.055
78.153.211.045
78.153.211.050
78.153.209.220
78.153.208.253
78.153.211.176
78.153.209.171
78.153.211.036
78.153.208.227
78.153.209.101
78.153.210.013
78.153.210.252
78.153.211.026
78.153.211.058
78.153.211.124
78.153.211.129
Update: 08:50: It looks like a drive failure caused the raid array to go offline on this host. The machine is backup now and we'll replace the drive today.
All the VPS are booting now, quite a number are already back online.
Update: 09:30: There are 9 VMs left to boot right now. Their VEIDs are:
2279
2258
2254
3073
2484
3062
2877
2221
2311
The reason for this is that a quota check / fsck equivalent is required to boot the VMs after the node went offline.
Update: 10:25: There are 2 containers still running disk checks, they are:
2484
2877
We believe they should be back at the latest 11am.
Update 11:45 All containers have now started.
As part of our ongoing commitment to bring you the latest in technology improvement, we are performing a major upgrade to our Windows VPS platform. The second server will be upgraded next Monday, 07/11/11, at 23:00 GMT.
The following VPSs are on this server, and will be offline for the period of this upgrade, which should take no more than 90-120 minutes. We apologise for this downtime, and hope no inconvenience will be caused.
Effected VPSs
78.153.208.116
78.153.208.127
78.153.208.28
78.153.209.211
78.153.208.168
78.153.208.172
78.153.208.169
78.153.208.171
78.153.209.151
78.153.208.62
78.153.208.176
78.153.208.179
78.153.208.184
78.153.210.75
78.153.208.222
78.153.208.223
78.153.209.118
78.153.209.160
78.153.209.164
78.153.209.107
We been alerted to a failed disk within the RAID array of PEMVZMPS67. Because of this we are going to bring this nodes offline tonight to replace the dead disk.
When: Wednesday 2nd November at 18:10. The maintenance window will be 1 hour.
What's affected?
78.153.215.160 pemlinweb67.blacknight.com
78.153.215.161 pemlinweb68.blacknight.com
We will update this blog post once everything is completed.
Update 18:20
This completed at 18:20
Services affected: Some access to MySQL servers, some access to some websites, Intermittant e-mail access.
Symptoms were but not limited to: Some shared hosts not being able to connect to mysql servers with unknown host errors. Some dedicated and colocated servers were inaccessible because the IP space they were on predates this data centre. Various oddities within the network that people might have observed. DNS related lookup problems which may have caused slow downs for login attempts to various systems.
Update: 12:20: this issue is fully resolved now.
As part of our ongoing commitment to bring you the latest in technology improvement, we are performing a major upgrade to our Windows VPS platform. The first server will be upgraded tomorrow (Wednesday 02/11/11) at 23:00 GMT.
The following VPSs are on this server, and will be offline for the period of this upgrade, which should take no more than 90 minutes. We apologise for this downtime, and hope no inconvenience will be caused.
Effected VPSs
78.153.208.57
78.153.208.86
78.153.208.91
78.153.208.94
78.153.208.111
78.153.208.67
78.153.209.136
78.153.208.155
78.153.211.69
Servers Affected:
pemlinweb95 78.153.215.233
pemlinweb96 78.153.215.234
pemlinweb97 78.153.215.235
pemlinweb98 78.153.215.236
Update @ 12:04pm: Servers are back up and we will investigate the cause further.