[ale] Linux HA

James P. Kinney III jkinney at localnetsolutions.com
Wed Oct 31 16:35:56 EDT 2007


On Wed, 2007-10-31 at 15:28 -0400, Charles Shapiro wrote:
> I believe that what you are looking for is what one ALE lecturer
> called a "STONITH Interface" ("Shoot the Other Node in the Head").  

And the poor man's STONITH interface - the X10 appliance module.
> 
> -- CHS
> 
> 
> On 10/31/07, Christopher Fowler <cfowler at outpostsentinel.com> wrote:
>         I've been testing some stuff in regards to Linux HA
>         today.  Normally we
>         sell 2 servers.  One is a "master" and the other is a
>         "slave".  I've
>         been testing today the capability to use a floating IP address
>         and allow 
>         the slave to take over for the master.  I have a few issues
>         that do need
>         to be resolved before I can roll this out.  In my lab and colo
>         I
>         experienced 2 issues that HA could not have saved me from.
>         
>         #1.  Kernel not responding. 
>         
>         In this case I can ping the server.  All connect()'s from
>         clients
>         seem to hang until they timeout.  In this scenario my slave
>         will take
>         the IP address but the master will still have it and still
>         answer pings. 
>         Also he will still answer arp requests.  HA can't save me
>         here.
>         
>         #2.  Kernel and programs still respond but disks are off
>         
>         In this case I/O to drives was hosed.  Apache would serve up
>         pages that
>         were in memory but any request in a page on disk would result
>         in that 
>         connection hanging forever.  No I/O possible.  In this
>         scenario the
>         heartbeat agent will probably still see a server that is
>         working but the
>         reality would be a DoS condition.  Also upon seeing this issue
>         I'm still 
>         left with a server who will not relinquish his IP address.
>         
>         In both cases it seems my only recourse is to allow my slave
>         to also
>         control the power of the master.  If #1 and #2 exist the slave
>         can
>         simply take the floating IP and make a determination if he
>         needs to kill 
>         power.  If so he can kill power and then the master can be
>         repaired.
>         
>         Ideas?
>         
>         Chris
>         
>         
>         
>         _______________________________________________
>         Ale mailing list
>         Ale at ale.org
>         http://www.ale.org/mailman/listinfo/ale
> 
> 
> -- 
> This message has been scanned for viruses and 
> dangerous content by MailScanner, and is 
> believed to be clean. 
> _______________________________________________
> Ale mailing list
> Ale at ale.org
> http://www.ale.org/mailman/listinfo/ale
-- 
James P. Kinney III          
CEO & Director of Engineering 
Local Net Solutions,LLC        
770-493-8244                    
http://www.localnetsolutions.com

GPG ID: 829C6CA7 James P. Kinney III (M.S. Physics)
<jkinney at localnetsolutions.com>
Fingerprint = 3C9E 6366 54FC A3FE BA4D 0659 6190 ADC3 829C 6CA7
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part




More information about the Ale mailing list