wiki:Documentation/CGettingStarted

Version 50 (modified by nilanjan, 13 years ago) ( diff )

Orbit > HowToGetStarted

How to get started

First, you need an account. Please check the UsagePolicy if you are eligible. In order to get an account please register here.

A typical experiment requires the following three steps:

Running the Experiment

During your approved time slot, you will be able to ssh into the console of the respective grid. A console is a dedicated machine that allows access to all resources on that grid.

During your approved time slot, you can then log into the console corresponding to the following table using SSH:

Main grid (400 nodes) console.grid.orbit-lab.org
Sandbox 1 (2 nodes) console.sb1.orbit-lab.org
Sandbox 2 (2 nodes) console.sb2.orbit-lab.org
Sandbox 5 (2 nodes) console.sb5.orbit-lab.org
Sandbox 6 (2 nodes) console.sb6.orbit-lab.org
Sandbox 7 (2 nodes) console.sb7.orbit-lab.org
Sandbox 8 (2 nodes) console.sb8.orbit-lab.org

For e.g to access the sandbox2,

yourhost>ssh username@console.sb2.orbit-lab.org

When you have successfully logged in, you can start an experiment using the nodehandler. First time users are highly encouraged to reserve time on a sandbox instead of the main grid, and start with the built-in Hello World experiment.

  • Before we begin using the nodes, it's a good idea to check their status first. This is done with the omf stat command.This will typically produce a result like:

No image "newhowto1.jpg" attached to Documentation/CGettingStarted

  • The node can be in 1 of 3 states:
POWEROFF Node is Available for use but turned off
POWERON Node is Available and is on
NODE NOT AVAILABLE Node is not Available for use
  • NOTE: It is recommended that the node be in the POWEROFF state prior to any imaging process. If the node is in the POWERON state you can use the omf tell command to get the node into the off state.
username@console.sb2:~$ omf tell offh all

If the node is in the NODE NOT AVAILABLE state, you may need to wait for it to recover the POWEROFF state (it some times requires a few moments for the service to sync up). If the node never comes out of the NODE NOT AVAILABLE state please contact an administrator.

  • Prior to the experiment, users need to install the baseline image on the hard disks of the nodes. This is done with the omf load command
    username@console.sb2:~$ omf load Topology ImageName
    
    Where Topology specifies the set of nodes you wish to image, and ImageName is the name of the image you with to load. The most common sandbox starting image command would look like
    username@console.sb2:~$ omf load all baseline.ndz
    
    which will load all the nodes of sandbox 2 (toatling 2) with the baseline image.
  • The process start should look like:

No image "newhowto2.jpg" attached to Documentation/CGettingStarted

  • A key line to look for is INFO whenAll: *: 'status[@value='UP']' fires. This line indicates that all the nodes have come up and imaging has begun:

No image "newhowto3.jpg" attached to Documentation/CGettingStarted

  • The final result should look like:

No image "newhowto4.jpg" attached to Documentation/CGettingStarted

  • By default the driver modules for the wireless interfaces are disabled. It is up to the experimenter to decide which interface to use. For this tutorial experiment the ath_pci module will be used. So before running the tutorial experiment ssh into each node (ie. node1-1 & node 1-2) and load the driver modules.
    username@console.sb2:~$ ssh root@node1-1
    
    From the node load the driver module:
    root@node1-1:~# modprobe ath_pci
    
    Verify that the module has been loaded into the kernel
    root@node1-1:~# lsmod
    

No image "newhowto_lsmod.jpg" attached to Documentation/CGettingStarted

Now ssh into node1-2 and do the same.

  • To run a tutorial experiment that involves one UDP traffic sender and one receiver. Make sure you're back at the console.
    username@console.sb2:~$ omf-5.2 exec --tutorial -- --tutorialName tutorial-1a
    
    Once this experiment is started, the output should look similar to:

No image "newhowto5.jpg" attached to Documentation/CGettingStarted

  • Both, sender and receiver, report measurements to a database, using the OML measurement framework. The file is saved as a sqlite3 file (extension .sq3); the file name for the experiment is shown in the last line of the tutorial's output and saved in the console under /var/lib/oml2

To dump the database file for this experiment:

 username@console.sb2:~$ sqlite3 /var/lib/oml2/sb8.orbit-lab.org_2011_07_12_16_00_33.sq3 ".dump"

The experiment can be started with:

user#> nodehandler -t

No image "howto4.PNG" attached to Documentation/CGettingStarted

  • This experiment will send UDP datagrams of 1024 bytes from node 1-1 to node 1-2 at 300 kbps CBR traffic.
  • Both, sender and receiver, report measurements to a database, using our OML measurement framework.
  • As shown below, the experiment controller will power on the nodes involved in the experiment and will issue experiment commands to each node.
  • Each experiment has a unique experiment ID as shown in the figure, that can be used later to view the results from the database

No image "howto5.PNG" attached to Documentation/CGettingStarted

Alternatively, a specific script can be run as follows:

The experiment can be started with:

user#> nodehandler <full-path/script-name>

For e.g., if my script is called orbit-test.rb and it resides in /home/joenull/Ruby-Scripts/ (ORBIT home directory), I would execute it as follows:

user#>pwd
/home/joenull
user#>nodehandler ~/Ruby-Scripts/orbit-test 

Note that I leave out the ".rb" at the end. This will execute the scripts and turn the nodes OFF at the end of the experiment. If you want to leave them ON after the experiment, use the "-k" flag. For e.g.

user#>pwd
/home/joenull
user#>nodehandler -k ~/Ruby-Scripts/orbit-test 

The experimenter can also move to where the script resides and execute it (without giving the full path) since nodehandler will look for the script in the current directory.

More information on writing experiment scripts can be found in the Tutorial.

Analyzing Results

Orbit provides a sophisticated framework to efficiently collect measurements at runtime into a database. This database is accessible to the experimenter during the experiment from the console. At the end of an experiment, the database is copied to an external machine and is accessible without a reservation. More information can be found here.

Where to go from here

If you are still unsure what Orbit is, please read the FAQ and the Tutorial, otherwise go ahead and register.

Attachments (5)

Download all attachments as: .zip

Note: See TracWiki for help on using the wiki.