
Re: beowulf type cluster for larc
Well, I tried for several hours on Wednesday with the help of a few other people to do automated installs onto the other 7 nodes of the cluster using FAI. One node is acting as the head node; one of its NICs goes to UC's network, and the other goes to the crappy old 10 megabit switch, which the other 7 nodes plug into. Yes, I know 10 megabit is very slow.
Getting the nodes to netboot was, surprisingly, the easy part. The hard (so far impossible) part was getting them to do something useful once netbooting was done. The problem was that FAI was having them netboot using eth1 into some configuration which then proceeded to try to get an address from eth0 and refused to proceed further as there was no cable on eth0, and none of our attempts to make it use the other card were successful. I tried switching the cable from eth1 to eth0 once netbooting had finished, and then it went one step further and halted yet again.
I suppose we could just remove the non-netbooting NIC from each node, but if we are going to remove each node individually anyway, a number of other methods for cloning an install become easy.
It was looking like I also could do a Debian netboot and use a feature called "preseed" in which I could pass some options to the kernel at startup and the install would then run totally non-interactively using all the settings from another install (basically like Kickstart from RH). I couldn't get too far with this, as apparently preseed isn't a standard feature so I'd need some branched/forked version.
But... I was reading today about
OpenSSI. The "SSI" there stands for Single-System Image, which means that the view the software presents of the cluster is like that of a single computer, which differs from MPI and PVM and the explicit message-passing between nodes.
One node acts as the master node, and it netboots the other nodes (which don't even need hard drives). From there, as I understand it, it will automatically migrate processes (but not threads) between nodes as needed, and every node sees a very similar filesystem. So most software runs unmodified, but performance benefits will only be seen if there are enough processes to distribute. I wonder if this could speed up long builds...
Problem is, OpenSSI seems to run best on older distros and probably kernel 2.4. I think it involves a modified kernel - hopefully one that the Myrinet drivers will run on.
So perhaps I will try this. Maybe I can convince Adam that one of the workstations we'll soon acquire/get working can be a head node instead of one of the P2s.