SDS BASIC

From HandWiki
Short description: Dialect of the BASIC programming language
SDS BASIC
Paradigmimperative
First appeared1967; 57 years ago (1967)
OSseveral
Influenced by
Dartmouth BASIC
Influenced
HP Time-Sharing BASIC, SCELBAL many others

SDS BASIC, also known as CP-V BASIC, Batch BASIC or Sigma BASIC depending on the version, is a BASIC programming language compiler for Scientific Data Systems's (SDS) Sigma series mainframe computers, originally released in 1967. Xerox purchased SDS in 1969 and began rebranding it as Xerox Data Systems, and finally just Xerox, at which time the language became known as Xerox BASIC.

The original versions did not include support for string variables, although this was added for the version running under the CP-V operating system when it was released in 1971. The string library allowed manipulation of strings using array slicing syntax, similar to the system found in HP Time-Shared BASIC and a number of microcomputer BASICs like Integer BASIC and Atari BASIC.

This style of string manipulation differs from the model introduced in DEC's BASIC-PLUS, which used string functions. Altair BASIC was patterned on BASIC-PLUS, and this style became the de facto standard after the cross-platform version, Microsoft BASIC, became almost universal during the home computer era. This makes conversion from SDS to modern dialects somewhat difficult.

SDS BASIC is historically notable as the platform on which the original Star Trek computer game was written in the summer of 1971.[1]

Syntax

In keeping with the original Dartmouth BASIC model, and in common with most mainframe-based BASICs, SDS BASIC was a compiler, not an interpreter, and used separate but tightly integrated tools for editing and running. The editor was dedicated to BASIC; as lines are entered they are analyzed for correct syntax and then stored in tokenized form. If a line is entered with a number at the start, it is placed into the appropriate location in the program based on the number, lines without numbers were immediately processed and then forgotten. SDS allowed line numbers in the range 1 to 99999.[2] A simple example is:

10 PRINT "HELLO WORLD"

Like many versions of BASIC, SDS allowed a single line of code to contain multiple statements. In most dialects this is indicated using the colon, but SDS used either the backslash, \ or ampersand, &.[3] An equivalent line in SDS would be:

10 PRINT "HELLO" \ PRINT "WORLD"

Because SDS was a compiler, and the user's source code was stored separately, it preserved leading spaces in the code. They suggested using this to more clearly indicate the structure of loops:[4]

10 LET J=0, K=1
20 FOR I=K TO 8
30    PRINT J
40    M=J, J=K, K=K+1
50 NEXT I

This example also includes the ability to set multiple values in a single LET statement, as seen in lines 10 and 40. This feature was also found in HP BASIC. As in most versions of BASIC, the keyword LET is optional, and is left out of line 40. Line 40 also illustrates the alternate form of assignment seen in a number of early BASICs, where multiple assignments can be separated with commas.[5]

Common mathematical operations included +, -, * and /, using ^ or ** for exponents.[2] Logical comparisons, like HP, could be written in any order, so <= was equivalent to =<, and >< was the same as <>.[3] Most dialects only allow the later syntax in both cases. Unlike HP BASIC, SDS did not include the ability to use the hash, #, as an alternate form of <>.

SDS supported computed GOTO using ON, but also allowed the alternate syntax with the ON at the end of the line instead of the front:

GOTO 140,160,180 ON Y

An alternate form of REM was a leading star, *,[6] similar in purpose to the short-form found in MS-derived BASICs, the single quote, '.[7]

String handling

The early versions of SDS BASIC, sometimes known as Batch BASIC or BTM BASIC, did not support string variables. However, string values could be assigned to numeric variables which could store six characters as their EBCDIC numeric values.[8] The later CP-V BASIC added true string variables containing up to 132 characters that could be indicated with a trailing dollar sign, $, or uncommonly, a leading $ — A$ and $A are equivalent. The manual notes that the latter format was added for compatibility with A00 BASIC.[9][lower-alpha 1]

Internally, CP-V used a fixed-length string layout consisting of an array of bytes that could not be changed in size at runtime. This was similar to the other BASICs that used slicing notation, like HP and Atari. It contrasts with the DEC/Microsoft style, where strings were dynamically allocated on a heap. Unlike the other fixed-length BASIC dialects, string variables did not have to be DIMmed prior to use, instead, all strings defaulted to a maximum length of 72 characters. This was a global setting that could be changed to better manage memory by setting it to a smaller value, or allow longer strings up to a maximum of 132 characters. This was accomplished using SET $=132. Confusingly, SET was also used for the entirely unrelated task of allowing variables to be used in DIM statements.[10]

Because strings in CP-V did not present themselves publicly as arrays of characters, the slicing syntax was not based on array syntax. This meant CP-V also allowed the construction of arrays of strings, something dialects like HP and Atari lacked because they used array notation for slicing. One could thus make a matrix of strings using DIM:[9]

10 DIM A$(5,10)

would define a matrix containing a total of 50 72-character strings in a 5 by 10 matrix. Because the system supported arrays, slicing syntax was somewhat more complex than the system found in other versions. Examples include:[9]

A$(1)

selects the first string in an array of strings, while:

A$(1,5)

selects a string from location 1,5 in a 2D array (matrix). Slicing was handled by adding a colon within the parens:[9]

A$(:4)

returned everything from the 4th character on:

A$(:4,5)

selects five characters starting with the 4th character, while:

A$(1,5:4,5)

selects five characters starting with the 4th character of the string in locations 1,5 in a 2D array.[9]

String constants could be delimited with either single or double quotes, making it easier to enclose quotes within lines. Valid examples include:[11]

PRINT 'THIS IS A STRING LITERAL'
PRINT "THIS IS ANOTHER ONE"
PRINT 'AND THIS IS "A THIRD"'
PRINT "AND 'A FOURTH'"

CP-V also includes the CHANGE command from Dartmouth BASIC, which converts a string into a series of EBCDIC numeric values and places them into a numeric array:[12]

10 DIM B(6)
20 CHANGE "ABCDEF" TO B

B will now contain a series of numbers corresponding to the EBCDIC values for each character.[12]

Input/output

SDS included a robust system for input/output based on "stream" numbers that could be specified in PRINT and INPUT statements. For instance, data could be read from a file using:

10 OPEN 'FILE1' TO :1,INPUT
20 INPUT :1,A

The first line opens a disk file with the name "FILE1", assigns it to stream 1, and indicates it will be used for input. The second line reads the next data from stream 1, in this case the first value, and assigns the resulting data to variable A. The system also included a unique "key" system that allowed data to be numbered and then accessed randomly. This was accomplished by adding the key number after a semicolon:

INPUT :1;9999,A

will attempt to read record 9999. Valid keys were in the range .001 to 9999.999, and if no keys were assigned in the files, lines were assigned numbers 1.000, 2.000, etc. The highest-valued key in a file could read using:[13]

KEY(1)

where the 1 is the stream number.[13] Unstructured binary data could be read and written byte-at-a-time using GET and PUT. [14]

MAT commands

Later versions of Dartmouth BASIC included a suite of MAT commands that allowed operations on entire arrays (matrices) with a single statement. These were also available in SDS BASIC. In their simplest form, the MAT is used like an alternate form of LET, applying an expression to all the elements in an array. For instance:

100 DIM A(20),B(20)
...
200 MAT A=A+B

Will add the value of every value in B to every entry in A, in the same fashion as:

100 DIM A(20),B(20)
...
200 FOR I=1 TO 20
210 A(I)=A(I)+B(I)
220 NEXT I

As well as making the code shorter and more obvious, these commands also have the advantage of being highly optimized, easily outperforming the use of FOR/NEXT.[15] Additional functions and statements modify PRINT and INPUT, invert arrays, and build identity matrixes and such in a single statement.[16]

Notes

  1. A00 appears to be another name for Batch BASIC, where the leading $ was a macro for converting the values to EBCDIC.

References

Citations

  1. Markowitz, Maury; Mayfield, Mike (2000). "Star Trek". https://gamesoffame.wordpress.com/star-trek/. Retrieved 2018-11-27. 
  2. 2.0 2.1 Xerox 1974, p. 2.
  3. 3.0 3.1 Xerox 1974, p. 19.
  4. Xerox 1974, p. 3.
  5. Xerox 1974, p. 9.
  6. Xerox 1974, p. 17.
  7. "REM Statement (Visual Basic)". 20 July 2015. https://docs.microsoft.com/en-us/dotnet/visual-basic/language-reference/statements/rem-statement. 
  8. Xerox 1974, p. 23.
  9. 9.0 9.1 9.2 9.3 9.4 Xerox 1974, p. 21.
  10. Xerox 1974, p. 40.
  11. Xerox 1974, p. 8.
  12. 12.0 12.1 Xerox 1974, p. 25.
  13. 13.0 13.1 Xerox 1974, p. 26.
  14. Xerox 1974, p. 29.
  15. Ref 1976, p. 11-50.
  16. Xerox 1974, p. 34.

Bibliography