Thinking in Rsh

To help you understand - and get the most out of - rsh, we've put together this section on "thinking in rsh". By learning to think in rsh and use the patterns it provides, you'll hit fewer issues getting started and be better setup for success.

So what does it mean to think in rsh? Here are some common topics that come up with new users of rsh.

rsh isn't bash

rsh is both a programming language and a shell and because of this has its own way of working with files, directories, websites, and more. We've modeled this to work closely with what you may be familiar with other shells. Pipelines work by attaching two commands together:

> ls | length

rsh, for example, also has support for other common capabilities like getting the exit code from previously run commands.

While it does have these amenities, rsh isn't bash. The bash way of working, and the POSIX style in general, is not one that rsh supports. For example, in bash, you might use:

> echo "hello" > output.txt

In rsh, we use the > as the greater-than operator. This fits better with the language aspect of rsh. Instead, you pipe to a command that has the job of saving content:

> "hello" | save output.txt

Thinking in rsh: The way rsh views data is that data flows through the pipeline until it reaches the user or is handled by a final command. You can simply type data, from strings to JSON-style lists and records, and follow it with | to send it through the pipeline. rsh uses commands to do work and produce more data. Learning these commands and when to use them helps you compose many kinds of pipelines.

Think of rsh as a compiled language

An important part of rsh's design and specifically where it differs from many dynamic languages is that rsh converts the source you give it into something to run, and then runs the result. It doesn't have an eval feature which allows you to continue pulling in new source during runtime. This means that tasks like including files to be part of your project need to be known paths, much like includes in compiled languages like C++ or Rust.

For example, the following doesn't make sense in rsh, and will fail to execute if run as a script:

"def abc [] { 1 + 2 }" | save output.rsh
source "output.rsh"
abc

The source command will grow the source that is compiled, but the save from the earlier line won't have had a chance to run. rsh runs the whole block as if it were a single file, rather than running one line at a time. In the example, since the output.rsh file is not created until after the 'compilation' step, the source command is unable to read definitions from it during parse time.

Another common issue is trying to dynamically create the filename to source from:

> source $"($my_path)/common.rsh"

This would require the evaluator to run and evaluate the string, but unfortunately rsh needs this information at compile-time.

Thinking in rsh: rsh is designed to use a single compile step for all the source you send it, and this is separate from evaluation. This will allow for strong IDE support, accurate error messages, an easier language for third-party tools to work with, and in the future even fancier output like being able to compile rsh directly to a binary file.

For more in-depth explanation, check How rsh Code Gets Run.

Variables are immutable

Another common surprise for folks coming from other languages is that rsh variables are immutable (and indeed some people have started to call them "constants" to reflect this). Coming to rsh you'll want to spend some time becoming familiar with working in a more functional style, as this tends to help write code that works best with immutable variables.

You might wonder why rsh uses immutable variables. Early on in rsh's development we decided to see how long we could go using a more data-focused, functional style in the language. More recently, we added a key bit of functionality into rsh that made these early experiments show their value: parallelism. By switching from each to par-each in any rsh script, you're able to run the corresponding block of code in parallel over the input. This is possible because rsh's design leans heavily on immutability, composition, and pipelining.

Just because rsh variables are immutable doesn't mean things don't change. rsh makes heavy use of the technique of "shadowing". Shadowing means creating a new variable with the same name as a previously declared variable. For example, say you had an $x in scope, and you wanted a new $x that was one greater:

let x = $x + 1

This new x is visible to any code that follows this line. Careful use of shadowing can make for an easier time working with variables, though it's not required.

Loop counters are another common pattern for mutable variables and are built into most iterating commands, for example you can get both each item and an index of each item using each:

> ls | enumerate | each { |it| $"Number ($it.index) is size ($it.item.size)" }

You can also use the reduce command to work in the same way you might mutate a variable in a loop. For example, if you wanted to find the largest string in a list of strings, you might do:

> [one, two, three, four, five, six] | reduce {|curr, max|
    if ($curr | str length) > ($max | str length) {
        $curr
    } else {
        $max
    }
}

Thinking in rsh: If you're used to using mutable variables for different tasks, it will take some time to learn how to do each task in a more functional style. rsh has a set of built-in capabilities to help with many of these patterns, and learning them will help you write code in a more rsh-style. The added benefit of speeding up your scripts by running parts of your code in parallel is a nice bonus.

rsh's environment is scoped

rsh takes multiple design cues from compiled languages. One such cue is that languages should avoid global mutable state. Shells have commonly used global mutation to update the environment, but rsh steers clear of this approach.

In rsh, blocks control their own environment. Changes to the environment are scoped to the block where they happen.

In practice, this lets you write some concise code for working with subdirectories, for example, if you wanted to build each sub-project in the current directory, you could run:

> ls | each { |it|
    cd $it.name
    make
}

The cd command changes the PWD environment variables, and this variable change does not escape the block, allowing each iteration to start from the current directory and enter the next subdirectory.

Having the environment scoped like this makes commands more predictable, easier to read, and when the time comes, easier to debug. rsh also provides helper commands like def --env, load-env, as convenient ways of doing batches of updates to the environment.

There is one exception here, where def --env allows you to create a command that participates in the caller's environment.

Thinking in rsh: - The coding best practice of no global mutable variables extends to the environment in rsh. Using the built-in helper commands will let you more easily work with the environment in rsh. Taking advantage of the fact that environments are scoped to blocks can also help you write more concise scripts and interact with external commands without adding things into a global environment you don't need.

# Thinking in Rsh

# rsh isn't bash

# Think of rsh as a compiled language

# Variables are immutable

# rsh's environment is scoped

Thinking in Rsh

rsh isn't bash

Think of rsh as a compiled language

Variables are immutable

rsh's environment is scoped