start
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
start [2019/08/30 13:28] – deadline | start [2022/03/31 15:51] (current) – deadline | ||
---|---|---|---|
Line 1: | Line 1: | ||
=====Welcome to the Data Science Lab Cluster===== | =====Welcome to the Data Science Lab Cluster===== | ||
- | This computation resource is a cluster of workstations that can work together as one big systems. Currently, the system can run large Hadoop and Spark jobs. There are also three GPU equipped nodes that are configured to run TensorFlow. | + | ===== DSL LAB IS OPEN ===== |
- | FOR HELP CLICK ON THE "How Do I" LINK BELOW | + | You will need an account to get started (contact the Lab Admin). This wiki will be updated as new capabilities are added. |
+ | |||
+ | * [[how_do_i# | ||
+ | * [[how_do_i# | ||
+ | * [[how_do_i# | ||
+ | * [[how_do_i# | ||
+ | * [[how_do_i# | ||
+ | **Watch this space for updates.** | ||
+ | |||
+ | ====About The System==== | ||
+ | |||
+ | This computation resource is a collection of nine individual workstations that can work together as a scalable data science cluster for Big Data processing. The system can run large Hadoop and Spark jobs using the 10 TByte Hadoop Distributed File System (HDFS) on up to 120 cores. There are also three GPU equipped nodes that are configured to run TensorFlow. Total system memory is 600 GBytes spread across | ||
+ | 30 separate motherboards. | ||
+ | |||
+ | Each workstation provides a Linux desktop environment that supports Anaconda Navigator (Python), Rstudio, and the Zeppelin web notebook (Spark, PySpark, Hadoop Hive,HBase, Python) | ||
+ | |||
+ | ====FOR HELP CLICK ON THE "How Do I" LINK BELOW==== | ||
* [[System Description|System Description]] | * [[System Description|System Description]] | ||
Line 9: | Line 25: | ||
* [[How Do I|How Do I]] | * [[How Do I|How Do I]] | ||
* [[Adminstration|Administration]] | * [[Adminstration|Administration]] | ||
+ | |||
+ | **HINT:** To get back to this main page from any page in the wiki, click on the **Data Science Lab** in the upper left corner. | ||
**System News:** | **System News:** | ||
+ | Feb-18-2022 | ||
+ | Feb-14-2022 | ||
+ | Feb-07-2022 | ||
+ | Jan-21-2022 | ||
+ | Jan-20-2022 | ||
+ | Nov-11-2012 | ||
+ | ---- OLD SYSTEM ---- | ||
+ | Aug-30-2019 | ||
+ | | ||
Feb-20-2019 | Feb-20-2019 | ||
Nov-27-2018 | Nov-27-2018 | ||
Line 26: | Line 53: | ||
| | ||
!!! Be sure to run "scl enable devtoolset-6 python27 bash" to use Python 2.7 | !!! Be sure to run "scl enable devtoolset-6 python27 bash" to use Python 2.7 | ||
- | Feb-09-2018: | + | |
- | ssh connections (machine to machine) will close after 1 hour of inactivity. | + | |
start.1567171735.txt.gz · Last modified: 2019/08/30 13:28 by deadline