Metaprogramming: Difference between revisions

From Rosetta Code
Content added Content deleted
m (PARI actually doesn't need prec in this case)
Line 144: Line 144:
how numbers are compared (for equalness, greater or equal to, less or equal
how numbers are compared (for equalness, greater or equal to, less or equal
to, etc).
to, etc).

=={{header|SNOBOL4}}==
There are several features of SNOBOL4 which could be considered meta-programming. The first of these is the ability to define synonyms for existing operators or functions, a feature which can help in creating DSLs of sorts in SNOBOL4 programs. For example the following code will alias the built-in function <code>IDENT</code> to <code>SAME</code> and the unary operator <code>*</code> to <code>$</code>:
<lang snobol4>
OPSYN('SAME','IDENT')
OPSYN('$','*',1)
</lang>

This is a simplistic use of <code>OPSYN</code>, however. More interesting is the aliasing of a function to an operator:
<lang snobol4>
OPSYN('F','*',1)
</lang>

In this setup, calling <code>F(X)</code> is the same as using the sequence <code>*X</code> which, in more complicated expressions, could result in better readability.

Other metaprogramming features supported would include the use of unevaluated expressions. If, in code, <code>E</code> is an expression it has a value as soon as it is defined and/or assigned to. <code>*E</code>, on the other hand, has a value only when it is evaluated either in the context of a pattern or in the context of an <code>EVAL</code>. The following example shows the motivation for unevaluated expressions in pattern matching contexts:
<lang snobol4>
&ANCHOR = 0 ; &TRIM = 1
WORD = BREAK(' .,') . W SPAN(' .,')
STRING1 = INPUT :F(ERROR)
STRING2 = INPUT :F(ERROR)
LOOP STRING1 WORD = :F(OUTPUT)
STRING2 ' ' W ANY(' .,') :F(LOOP)
LIST = LIST W ', ' :(LOOP)
OUTPUT OUTPUT = LIST
END
</lang>

In this code, two strings are input and a list of words appearing in both strings is generated. The problem with this code is that the pattern structure <code>' ' W ANY(' .,')</code> is built on each iteration. Since pattern building is expensive, putting it in a loop like this is bad form. It cannot be moved outside of the loop, however, since the value of W changes for each iteration. The solution to this is to defer the evaluation of the variable <code>W</code> until it is needed while keeping the rest of the pattern intact:
<lang snobol4>
&ANCHOR = 0 ; &TRIM = 1
WORD = BREAK(' .,') . W SPAN(' .,')
FINDW = ' ' *W ANY(' .,')
STRING1 = INPUT :F(ERROR)
STRING2 = INPUT :F(ERROR)
LOOP STRING1 WORD = :F(OUTPUT)
STRING2 FINDW :F(LOOP)
LIST = LIST W ', ' :(LOOP)
OUTPUT OUTPUT = LIST
END
</lang>
In this code, the pattern is constructed only once in the line <code>FINDW = ' ' *W ANY(' .,')</code>. The value of the variable <code>W</code>, however, is only provided when FINDW is used in a pattern match. In this case it is given its value from the line before when <code>STRING1</code> is matched against the pattern <code>WORD</code>. In this way the expense of building the pattern is paid only once, but the flexibility of matching a sequence of values is retained.

The final example of metaprogramming that's available lies in the idiosyncratic way that user-defined functions work in SNOBOL4. The fact that the <code>DEFINE</code> function can be recalled at any point to redefine any function is a powerful feature that can lead to very efficient code. (It can also lead to very unreadable code, of course, if not properly used.)

Consider this hand-waving example for the motivation:
<lang snobol4>
* This example provides a bizarrely-expensive addition operation.
* It assumes the existence of an expensive procedure—say a database
* lookup—to extract the value to be added. This version uses the
* typical initialize-on-definition approach to implementation.
DEFINE('XADD(X)','XADD')
ADDVALUE = CALL_SOME_EXPENSIVE_OPERATION() :(XADD.END)
XADD XADD = X + ADDVALUE :(RETURN)
XADD.END
</lang>

In normal operation the interpreter will execute the <code>DEFINE</code> function and then execute the <code>ADDVALUE = ...</code> line, branching *past* the actual body of the function to the label <code>XADD.END</code>. If, however, there are many such functions, and especially if there's the possibility that these functions are never actually called, this could render program startup very slow. For purposes of amortizing initialization time, or for purposes of saving unnecessary initialization, the following code is better:
<lang snobol4>
DEFINE('XADD(X)','XADD.INIT') :(XADD.END)
XADD.INIT ADDVALUE = CALL_SOME_EXPENSIVE_OPERATION()
DEFINE('XADD(X)','XADD')
XADD XADD = X + ADDVALUE :(RETURN)
XADD.END
</lang>

The code now defines the <code>XADD</code> function and immediately, without doing initialization, jumps to the <code>XADD.END</code> label, bypassing both initialization and the function body. The trick here is that it defines the entry point of the function to be the <code>XADD.INIT</code> label. Now the first time <code>XADD</code> is called, control is transferred to <code>XADD.INIT</code>, the expensive initialization is performed, then the <code>XADD</code> function is *redefined* to point to the <code>XADD</code> label as the entry point. From this point onward all calls to <code>XADD</code> only perform the calculation, not the expensive initialization while the expensive initialization isn't paid at all unless the function is used at least once.

There are, of course, many other uses for function redefinition in this style which are useful for metaprogramming efforts. Indeed such features are used prominently in the debugging subsystems of SNOBOL4 implementations.


=={{header|Tcl}}==
=={{header|Tcl}}==

Revision as of 14:06, 4 July 2011

Task
Metaprogramming
You are encouraged to solve this task according to the task description, using any language you may know.

Name and briefly demonstrate any support your language has for metaprogramming. Your demonstration may take the form of cross-references to other tasks on Rosetta Code. When possible, provide links to relevant documentation.

For the purposes of this task, "support for metaprogramming" means any way the user can effectively modify the language's syntax that's built into the language (like Lisp macros) or that's conventionally used with the language (like the C preprocessor). Such facilities need not be very powerful: even user-defined infix operators count. On the other hand, in general, neither operator overloading nor eval count. The task author acknowledges that what qualifies as metaprogramming is largely a judgment call.

E

Forms of metaprogramming existant in E:

  • Quasi-literals provide convenient notation for data structures and values for which there is not literal syntax, as discussed in String#E.
  • E program fragments may be quoted, manipulated as an AST, and evaluated, similarly to Lisp; lexical environments are first-class objects (though static with respect to the evaluated code). Demonstrated in Runtime evaluation#E and Eval in environment#E.
  • Control structures may be defined, as demonstrated in Extend your language#E.

J

J names have one of four different grammatic types: noun, verb, adverb, conjunction. Nouns and verbs are nothing special from a metaprogramming point of view. However, adverbs and conjunctions are evaluated at "parse time" and can be used to introduce expression variants. (The result of an adverb, or of a conjunction may be either a noun, a verb, an adverb or a conjunction.)

Additionally, J script blocks (a block of text terminated by a right parenthesis on a single line) can and are used with differing interpreters (which may be built into the language or user written).

The J implementation of the Y combinator could be considered an example of metaprogramming.

J's explicit definitions might also be considered an example of metaprogramming, since explicit definitions have data dependent syntactic types.

Lua

Due to the way Lua's syntactic sugar is designed, metatables can make some Lua code look like a Domain-Specific Language, despite technically being (mostly) just specialized operator overloading.

For example: <lang lua> class "foo" : inherits "bar" {

} </lang>

is perfectly valid syntax. (Lua does not having a built in concept of classes or inheritance.)

PARI/GP

The primary means of metaprogramming for GP is to extend it with PARI. As an example, defining an infix @ operator could work like

In src/functions/symbolic_operators/min0:

Function: _@_
Help: x@y: compute the lesser of x and y, or 0, whichever is larger.
Section: symbolic_operators
C-Name: gmin0
Prototype: GG
Description:
 (small, small):small	 smin0ss($1, $2)
 (mp, mp):mp            gmin0($1, $2)
 (gen, gen):gen     	 gmin0($1, $2)

In (e.g.) basemath/somefile.c: <lang C>long smin0ss(long a, long b) {

 long c = a < b ? a : b;
 return c > 0 ? c : 0;

}


GEN gmin0(GEN a, GEN b) {

 GEN c = gcmp(a, b) < 1 ? a : b; /* copy pointer */
 return signe(c) > 0 ? gcopy(c) : gen_0;

}</lang>


Perl

You can textually transform code with a source filter, a module that when used modifies the following lines of source. Filter::Util::Call provides a general means of writing source filters. Filter::Simple is an interface to Filter::Util::Call that lets you elide a lot of boilerplate code. More important, Filter::Simple can hide the contents of quoting constructs from your filter, obviating the biggest dangers of textual metaprogramming. For example, given the following module:

<lang perl>package UnicodeEllipsis;

use Filter::Simple;

FILTER_ONLY code => sub { s/…/../g };</lang>

this program:

<lang perl>use UnicodeEllipsis;

print join(' … ', 1 … 5), "\n";</lang>

prints:

 1 … 2 … 3 … 4 … 5

Devel::Declare lets you define a new keyword by setting up a hook to be run whenever the parser encounters a bareword of your choice. Devel::Declare is powerful, but it has a reputation for being difficult to understand. See this blog post for a relatively simple usage example.

PicoLisp

As in any Lisp, metaprogramming is an essential aspect of PicoLisp. In most cases normal functions are used to extend the language (see Extend your language#PicoLisp), read-macros operate on the source level, and also runtime macros are used occasionally.

PostScript

PostScrpt allows the reification of stack, scoping (Dynamic scoping is default, but lexical scoping can be implemented using immediate loading), bindings using dictionaries, and even control flow. Here is an example of implementation of if statement

Library: initlib

<lang postscript>

/ift { 4 dict begin

   [/.if /.then] let*
   count array astore /.stack exch def
   /_restore {clear .stack aload pop}.
   .stack aload pop .if {
      _restore .then
   } {
      _restore
   } ifelse

end}. </lang> The standard if expression in postscript does not take a predicate. Instead it acts on the boolean value on top of the stack. This newly created word allows us to do <lang postscript> >| 2 {1 gt} {=} ift 2 </lang> Instead of

<lang postscript> >| 2 dup 1 gt {=} ift 2 </lang>

Note that even the let expression was implemented using meta programming <lang postscript> /let* {reverse {exch def} forall}. </lang>

Python

Metaprogramming is frowned on in Python and considered un-pythonic. The only widely known example of metaprogramming in Python was an implementation of a goto (and comefrom) keyword done as an April-fools joke.

REXX

REXX doesn't allow for the changing or overriding of syntax per se, but any of built-in functions can be overided by just specifying your own.

REXX allows the programmer to set the precision of the numbers it uses in calculations and expressions, as well as the FUZZ settings, which effects how numbers are compared (for equalness, greater or equal to, less or equal to, etc).

SNOBOL4

There are several features of SNOBOL4 which could be considered meta-programming. The first of these is the ability to define synonyms for existing operators or functions, a feature which can help in creating DSLs of sorts in SNOBOL4 programs. For example the following code will alias the built-in function IDENT to SAME and the unary operator * to $: <lang snobol4>

       OPSYN('SAME','IDENT')
       OPSYN('$','*',1)

</lang>

This is a simplistic use of OPSYN, however. More interesting is the aliasing of a function to an operator: <lang snobol4>

       OPSYN('F','*',1)

</lang>

In this setup, calling F(X) is the same as using the sequence *X which, in more complicated expressions, could result in better readability.

Other metaprogramming features supported would include the use of unevaluated expressions. If, in code, E is an expression it has a value as soon as it is defined and/or assigned to. *E, on the other hand, has a value only when it is evaluated either in the context of a pattern or in the context of an EVAL. The following example shows the motivation for unevaluated expressions in pattern matching contexts: <lang snobol4>

       &ANCHOR = 0 ; &TRIM = 1
       WORD = BREAK(' .,') . W SPAN(' .,')
       STRING1 = INPUT                       :F(ERROR)
       STRING2 = INPUT                       :F(ERROR)

LOOP STRING1 WORD = :F(OUTPUT)

       STRING2 ' ' W ANY(' .,')              :F(LOOP)
       LIST = LIST W ', '                    :(LOOP)

OUTPUT OUTPUT = LIST END </lang>

In this code, two strings are input and a list of words appearing in both strings is generated. The problem with this code is that the pattern structure ' ' W ANY(' .,') is built on each iteration. Since pattern building is expensive, putting it in a loop like this is bad form. It cannot be moved outside of the loop, however, since the value of W changes for each iteration. The solution to this is to defer the evaluation of the variable W until it is needed while keeping the rest of the pattern intact: <lang snobol4>

       &ANCHOR = 0 ; &TRIM = 1
       WORD = BREAK(' .,') . W SPAN(' .,')
       FINDW = ' ' *W ANY(' .,')
       STRING1 = INPUT                       :F(ERROR)
       STRING2 = INPUT                       :F(ERROR)

LOOP STRING1 WORD = :F(OUTPUT)

       STRING2 FINDW                         :F(LOOP)
       LIST = LIST W ', '                    :(LOOP)

OUTPUT OUTPUT = LIST END </lang> In this code, the pattern is constructed only once in the line FINDW = ' ' *W ANY(' .,'). The value of the variable W, however, is only provided when FINDW is used in a pattern match. In this case it is given its value from the line before when STRING1 is matched against the pattern WORD. In this way the expense of building the pattern is paid only once, but the flexibility of matching a sequence of values is retained.

The final example of metaprogramming that's available lies in the idiosyncratic way that user-defined functions work in SNOBOL4. The fact that the DEFINE function can be recalled at any point to redefine any function is a powerful feature that can lead to very efficient code. (It can also lead to very unreadable code, of course, if not properly used.)

Consider this hand-waving example for the motivation: <lang snobol4>

  • This example provides a bizarrely-expensive addition operation.
  • It assumes the existence of an expensive procedure—say a database
  • lookup—to extract the value to be added. This version uses the
  • typical initialize-on-definition approach to implementation.
        DEFINE('XADD(X)','XADD')
        ADDVALUE = CALL_SOME_EXPENSIVE_OPERATION()        :(XADD.END)

XADD XADD = X + ADDVALUE  :(RETURN) XADD.END </lang>

In normal operation the interpreter will execute the DEFINE function and then execute the ADDVALUE = ... line, branching *past* the actual body of the function to the label XADD.END. If, however, there are many such functions, and especially if there's the possibility that these functions are never actually called, this could render program startup very slow. For purposes of amortizing initialization time, or for purposes of saving unnecessary initialization, the following code is better: <lang snobol4>

         DEFINE('XADD(X)','XADD.INIT')                    :(XADD.END)

XADD.INIT ADDVALUE = CALL_SOME_EXPENSIVE_OPERATION()

         DEFINE('XADD(X)','XADD')

XADD XADD = X + ADDVALUE  :(RETURN) XADD.END </lang>

The code now defines the XADD function and immediately, without doing initialization, jumps to the XADD.END label, bypassing both initialization and the function body. The trick here is that it defines the entry point of the function to be the XADD.INIT label. Now the first time XADD is called, control is transferred to XADD.INIT, the expensive initialization is performed, then the XADD function is *redefined* to point to the XADD label as the entry point. From this point onward all calls to XADD only perform the calculation, not the expensive initialization while the expensive initialization isn't paid at all unless the function is used at least once.

There are, of course, many other uses for function redefinition in this style which are useful for metaprogramming efforts. Indeed such features are used prominently in the debugging subsystems of SNOBOL4 implementations.

Tcl

Metaprogramming is considered to be normal in Tcl; the whole language was designed to support new operations that work with a similar level of integration to existing commands (and indeed, the standard commands are not syntactically special in any way), and the upvar and uplevel commands are specifically for this sort of use. Moreover, there are no language keywords that need to be worked around; words/tokens can be used to mean anything necessary. For example: <lang tcl>proc loopVar {var from lower to upper script} {

   if {$from ne "from" || $to ne "to"} {error "syntax error"}
   upvar 1 $var v
   if {$lower <= $upper} {
       for {set v $lower} {$v <= $upper} {incr v} {
           uplevel 1 $script
       }
   } else {
       # $upper and $lower really the other way round here
       for {set v $lower} {$v >= $upper} {incr v -1} {
           uplevel 1 $script
       }
   }

}</lang> The above creates a new loopVar command that might be used like this: <lang tcl>loopVar x from 1 to 4 {

   loopVar y from $x to 6 {
       puts "pair is ($x,$y)"
       if {$y >= 4} break
   }

}</lang> Which prints this:

pair is (1,1)
pair is (1,2)
pair is (1,3)
pair is (1,4)
pair is (2,2)
pair is (2,3)
pair is (2,4)
pair is (3,3)
pair is (3,4)
pair is (4,4)

As you can see, the new looping command is wholly integrated with the rest of the Tcl language.

Code generation is also possible. The easiest approach is to use the list command to generate substitution-free command, leaving all substitutions to places that are written by the programmer directly.

Finally, the total lack of keywords is exemplified by this classic fragment of Tcl: <lang tcl>set set set</lang> In this, the first set is a command (that updates a variable), the second is the name of a variable in the current namespace, and the third is a string that will be placed in a variable.