#1  
Old 08-15-2007, 09:32 PM
matta matta is offline
TekTonic Principal
 
Join Date: Aug 2006
Posts: 873
Default Outage: hsvz16.dal

Today a drive in the hsvz16.dal.tektonic.net's RAID-10 array went bad. We asked our datacenter to swap out the bad drive with a new one. What happened next is that the ports were mis-labeled by the datacenter and they swapped out the other drive of the RAID-1 subunit, causing a server crash.

We are attempting what we can to salvage the data and get hsvz16.dal.tektonic.net back up and operational.

UPDATE: It was the SuperMicro drive backplane that had the wires twisted. This will be checked on all setups in the future.
Reply With Quote
  #2  
Old 08-15-2007, 09:51 PM
psalzman psalzman is offline
Junior Member
 
Join Date: Aug 2007
Posts: 4
Default

The backplane had wires twisted and this caused the harddrive to short? Or did the drive fail and someone ignore which one had an amber light and which one was green?

I assume the RAID-10 volume is fine, but what was on the RAID-1 stripe that was lost?
Reply With Quote
  #3  
Old 08-15-2007, 09:56 PM
matta matta is offline
TekTonic Principal
 
Join Date: Aug 2006
Posts: 873
Default

This setup (3Ware 9550SX) doesn't do an amber led, but it does do drive activity and that should have looked for (the one that isn't blinking).

The wires were crossed inside the chassis and that caused the confusion in the mapping of ports.

1 RAID-1 set of a RAID-10 set is completely gone which has destroyed the RAID-0 stripe. Even putting all the drives back in exactly as they were lists it as degraded and unusable.
Reply With Quote
  #4  
Old 08-15-2007, 10:03 PM
psalzman psalzman is offline
Junior Member
 
Join Date: Aug 2007
Posts: 4
Default

Crap, OK, I misread RAID-1 for RAID-0. I see what you're talking about now. I'm not familiar with the 9550SX cards, but I've found in the past with some Dell Perc and Adaptec you occasionally have to do a power cycle to get them to find the valid information w/ the failed drive removed. It'll be degraded until its finished rebuilding, but not corrupted.

I just performed a server-side backup a few days ago, but I'm guessing those were stored on the local disks too?
Reply With Quote
  #5  
Old 08-15-2007, 10:08 PM
matta matta is offline
TekTonic Principal
 
Join Date: Aug 2006
Posts: 873
Default

We are performing a lot of different configurations to try to find one that will boot. If we don't within 15-20 minutes we've exhausted what we can try and everything is gone. We will then at least attempt to have the VPS's re-created as quickly as possible so those with backups can start restoring.

Server side backups are stored on the server, yes. This is the primary difference between the server-side and client-side backups in the HSPc CP.

To those who do not have a backup strategy in place please contact us and we can discuss various options and help you to develop your own backup strategy for any possible future issues similar to this.
Reply With Quote
  #6  
Old 08-16-2007, 03:19 AM
matta matta is offline
TekTonic Principal
 
Join Date: Aug 2006
Posts: 873
Default

All VPS's have been re-created based off the information contained in HSPComplete. If you cannot access your VPS please contact support to determine the cause. We are very sorry this this problem.
Reply With Quote
  #7  
Old 08-16-2007, 07:36 AM
apn3a apn3a is offline
Junior Member
 
Join Date: Dec 2006
Posts: 15
Default

Hello
You mean we lost everything? I can't access virtuosso, plesk and all my sites are dead!
Reply With Quote
  #8  
Old 08-16-2007, 09:20 AM
gluek gluek is offline
Junior Member
 
Join Date: Aug 2007
Location: Moscow, Russia
Posts: 3
Send a message via Skype™ to gluek
Default

Quote:
Originally Posted by apn3a View Post
Hello
You mean we lost everything? I can't access virtuosso, plesk and all my sites are dead!
My Control Panel work, but ssh password wasn't accepted. And restore of backup procedure doen't work!
Reply With Quote
  #9  
Old 08-16-2007, 01:57 PM
matta matta is offline
TekTonic Principal
 
Join Date: Aug 2006
Posts: 873
Default

Regarding access to the server the root passwords may not be what you think they are. If you contact our support via support@tektonic.net or Live Chat @ www.tektonic.net they can reset your root password.

HSPComplete is still showing the server-side backups are being there -- this is a database issue only, the actual backups are not there and cannot be restored.
Reply With Quote
  #10  
Old 08-16-2007, 02:30 PM
apn3a apn3a is offline
Junior Member
 
Join Date: Dec 2006
Posts: 15
Default

Quote:
Originally Posted by matta View Post
Regarding access to the server the root passwords may not be what you think they are. If you contact our support via support@tektonic.net or Live Chat @ www.tektonic.net they can reset your root password.

HSPComplete is still showing the server-side backups are being there -- this is a database issue only, the actual backups are not there and cannot be restored.
I did already contact (twice live chat) and 2 emails and still i'm in waiting list in order to know my password. Plesk not installed yet as well. I have a DEAD forum with 13.000 member because of YOUR fault and not the opportunity to setup a domain with a index page to inform them
Reply With Quote
Reply
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 06:09 PM.

Powered by vBulletin® Version 3.8.2
Copyright ©2000 - 2010, Jelsoft Enterprises Ltd.