Sun Grid Engine

From Wikipedia, the free encyclopedia

Jump to: navigation, search
Sun Grid Engine
GridEngine Logo
Developed by Sun Microsystems in association with the community
Latest release 6.2u2_1 / 2009-3-31; 14 days ago
Operating system Cross-platform
Type Grid computing
License SISSL
Website http://gridengine.sunsource.net

Sun Grid Engine (SGE), previously known as CODINE (COmputing in DIstributed Networked Environments) or GRD (Global Resource Director),[1] is an open source batch-queuing system, developed and supported by Sun Microsystems. Sun also sells a commercial product based on SGE, also known as N1 Grid Engine (N1GE).

SGE is typically used on a computer farm or high-performance computing (HPC) cluster and is responsible for accepting, scheduling, dispatching, and managing the remote and distributed execution of large numbers of standalone, parallel or interactive user jobs. It also manages and schedules the allocation of distributed resources such as processors, memory, disk space, and software licenses.

SGE is the foundation of the Sun Grid utility computing system, made available over the Internet in the United States in 2006,[2] later becoming available in many other countries.

Contents

[edit] Features

A screenshot of the xml-qstat web interface.

[edit] Features new in version 6.2

  • Advance reservation
  • Array job interdependencies
  • Enhanced remote execution (without using external rshd/rlogind/sshd processes)
  • Multi-clustering [3]
  • Daemons managed by the Service Management Facility on Solaris
  • Pseudo TTY (pty) support for interactive jobs

Other features of SGE include:

  • Multiple advanced scheduling algorithms allow powerful policy-based resource allocation
  • Cluster queues
  • Job and scheduler fault tolerance - Grid Engine continues to operate as long as there is one or more hosts available
  • Job checkpointing
  • Job arrays and job tasks
  • DRMAA (Job API)
  • Resource reservation
  • XML status reporting (qstat and qhost), and the xml-qstat web interface
  • Parallel jobs (MPI, PVM, OpenMP), and scalable parallel job startup with qrsh [4]
  • Usage accounting
  • Accounting and Reporting COnsole (ARCO)
  • parallel make: distmake, dmake (Sun Studio), and SGE's own qmake
  • FLEXlm integration [5] and multi-cluster software license management with LicenseJuggler [6]

[edit] Platforms

SGE runs on multiple platforms, including:

[edit] Cluster architecture

A typical Grid Engine cluster consists of a master host, and one or more execution hosts. Moreover, multiple shadow masters can be configured as hot spares, which take over the role of the master when the original master host crashes.

[edit] Support and training

Sun provides support contracts [7] for the commercial version of Grid Engine on most UNIX platforms and Windows. Professional services, consulting, training, and support are also provided by Sun Partners. [8] Sun partners with Georgetown University to deliver Grid Engine administration classes.[9] The Bioteam runs short SGE training workshops that are 1 or 2 days long.[10]

Users can get community support on the Grid Engine mailing lists.[11]

Grid Engine Workshops were held in 2002, 2003, and 2007 in Regensburg, Germany.[12]

[edit] Prominent users

Notable deployments of SGE include:

[edit] History

In 2000, Sun acquired Gridware, Inc. a privately owned commercial vendor of advanced computing resource management software with offices in San Jose, Calif., and Regensburg, Germany.[17] Later that year, Sun offered a free version of Gridware for Solaris and Linux, and renamed the product Sun Grid Engine.

In 2001, Sun made the source code available,[18] and adopted the open source development model. Ports for Mac OS X and *BSD were contributed by the non-Sun open source developers.

[edit] Other Grid Engine based products

[edit] Add-on software

A number of SGE add-ons are available:

[edit] References

  1. ^ "A Little History Lesson". Sun Microsystems. 2006-06-23. http://blogs.sun.com/templedf/entry/a_little_history_lesson. 
  2. ^ "World's First Utility Grid Comes Alive on the Internet". Sun Microsystems. 2006-03-22. http://www.sun.com/smi/Press/sunflash/2006-03/sunflash.20060322.1.xml. 
  3. ^ "Hedeby Project home". Sun Microsystems. http://hedeby.sunsource.net. Retrieved on 2008-01-25. 
  4. ^ "Long delay when submitting large jobs (mailing list message)". Sun Microsystems. http://gridengine.sunsource.net/servlets/ReadMsg?listName=users&msgNo=9446. Retrieved on 2007-12-25. 
  5. ^ "Olesen-FLEXlm-Integration". wiki.gridengine.info. http://wiki.gridengine.info/wiki/index.php/Olesen-FLEXlm-Integration. Retrieved on 2007-12-25. 
  6. ^ "LicenseJuggler". wiki.gridengine.info. http://wiki.gridengine.info/wiki/index.php/LicenseJuggler. Retrieved on 2007-12-26. 
  7. ^ "Sun Store Grid Engine Entitlement Purchase". Sun Microsystems. http://store.sun.com/CMTemplate/CEServlet?process=SunStore&cmdViewProduct_CP&catid=115672. Retrieved on 2008-03-03. 
  8. ^ "Sun Grid Engine 6 Partners". Sun Microsystems. http://www.sun.com/software/gridware/partners/index.xml. Retrieved on 2007-12-14. 
  9. ^ "Advanced Sun Grid Engine Configuration and Administration Class". Sun Microsystems. http://blogs.sun.com/templedf/entry/advanced_sun_grid_engine_configuration. Retrieved on 2007-12-14. 
  10. ^ "Training". The Bioteam Inc.. http://blog.bioteam.net/category/training/. Retrieved on 2008-03-24. 
  11. ^ "Grid Engine Mail Lists". Sun Microsystems. http://gridengine.sunsource.net/maillist.html. Retrieved on 2008-01-23. 
  12. ^ "Grid Engine Workshops". Sun Microsystems. http://gridengine.sunsource.net/workshop.html. Retrieved on 2007-12-14. 
  13. ^ "Sun N1 Grid Engine Software and the Tokyo Institute of Technology Super Computer Grid". Sun Microsystems. http://www.sun.com/blueprints/0607/820-1695.html. Retrieved on 2007-11-16. 
  14. ^ "TACC > HPC Systems". The University of Texas at Austin. http://www.tacc.utexas.edu/resources/hpcsystems/#ranger. Retrieved on 2007-12-13. 
  15. ^ "More Ranger Facts and Figures". Sun Microsystems. http://blogs.sun.com/marchamilton/entry/more_ranger_facts_and_figures. Retrieved on 2008-02-12. 
  16. ^ "TOP500 List - June 2008". TOP500.Org. 2006-06-18. http://top500.org/list/2008/06/100. 
  17. ^ "Gridware's resource management software increases efficiency and productivity in compute-intensive technical computing environments". Sun Microsystems. 2000-07-24. http://www.sun.com/smi/Press/sunflash/2000-07/sunflash.20000724.3.xml. 
  18. ^ "Sun Microsystems makes SUN GRID ENGINE software available to open source community". Sun Microsystems. 2001-07-23. http://www.sun.com/smi/Press/sunflash/2001-07/sunflash.20010723.1.xml. 
  19. ^ "Sun Compute Cluster Solution". Sun Microsystems. http://www.sun.com/servers/hpc/computecluster/index.jsp. 
  20. ^ "Sun Grid Engine, a new scheduler for EGEE middleware". Imperial College. 2000-12-29. http://pubs.doc.ic.ac.uk/egee-sge-integration/. 
  21. ^ "Installing and Configuring Sun Cluster HA for Sun Grid Engine". Sun Microsystems. 2008-02-15. http://docs.sun.com/app/docs/doc/819-3064/cacjgdbc?a=view. 

[edit] See also

[edit] External links

Personal tools