The PGF API is available as Web Service through the built-in HTTP server in the main GF executable. It is activated by starting GF with the -server
flag:
$ gf -server
This is GF version 3.8.
Document root = /usr/share/gf-3.8/www
Starting HTTP server, open http://localhost:41296/ in your web browser.
A compiled GF grammar (a .pgf
file) can be used in web applications by placing it somewhere under the document root. When there is a request for access to a .pgf
file, the GF web server will load and cache the grammar and interpret any parameters included in the URL (in the url-encoded-query format). The response you get back is usually a data structure in JSON format, but it could also be an image or plain text.
For example, if my_grammar.pgf
is a grammar placed directly under the document root, then the grammar could be accessed using this URL:
http://localhost:41296/my_grammar.pgf
The default when no parameters are included in the URL is a response with some general information about the grammar, encoded in JSON format. To perform specific command you have to tell what command you want to perform. The command is encoded in the parameter command
, i.e.:
http://localhost/my_grammar.pgf?command=
cmd
where cmd is the name of the command. Most commands require additional arguments, which are encoded as parameters as well. The supported commands and their arguments are described below.
This command provides some general information about the grammar. This command is also executed if no `command` parameter is given.
Parameter | Description | Default |
---|---|---|
command | should be grammar |
- |
A JSON object including the following fields:
Field | Description |
---|---|
name |
the name of the abstract syntax in the grammar |
userLanguage |
the concrete language in the grammar which best matches the default language set in the user's browser |
categories |
list of all abstract syntax categories defined in the grammar |
functions |
list of all abstract syntax functions defined in the grammar |
languages |
list of concrete languages available in the grammar |
Every language is described with object having this two fields:
Field | Description |
---|---|
name |
the name of the concrete syntax for the language |
languageCode |
the two-character language code according to the ISO standard i.e. en for English, bg for Bulgarian, etc. |
The language codes need to be specified in the grammar with flags language=...
. The web service receives the code of the language set in the browser and compares it with the codes defined in the grammar. If there is a match then the service returns the corresponding concrete syntax name. If no match is found then the first language in alphabetical order is returned.
This command parses a string and returns a list of abstract syntax trees.
Parameter | Description | Default |
---|---|---|
command |
should be parse |
- |
cat |
the start category for the parser | the default start category for the grammar |
input |
the string to be parsed | empty string |
from |
the name of the concrete syntax to use for parsing | all languages in the grammar will be tried |
limit |
limit how many trees are returned (gf>3.3.3) | no limit is applied |
List of objects where every object represents the analyzes for every input language. The objects have three fields:
Field | Description |
---|---|
from |
the concrete language used in the parsing |
brackets |
the bracketed string from the parser |
trees |
list of abstract syntax trees |
typeErrors |
list of errors from the type checker |
The abstract syntax trees are sent as plain strings. The type errors are objects with two fields:
Field | Description |
---|---|
fid |
forest id which points to a bracket in the bracketed string where the error occurs |
msg |
the text message for the error |
The current implementation either returns a list of abstract syntax trees or a list of type errors. By checking whether the field trees is not null we check whether the type checking was successful.
The command takes an abstract syntax tree and produces string in the specified language(s).
Parameter | Description | Default |
---|---|---|
command |
should be linearize |
- |
tree |
the abstract syntax tree to linearize | - |
to |
the name of the concrete syntax to use in the linearization | linearizations for all languages in the grammar will be generated |
Field | Description |
---|---|
to |
the concrete language used for the linearization |
tree |
the output text |
The translation is a two step process. First the input sentence is parsed with the source language and after that the output sentence(s) are produced via linearization with the target language(s). For that reason the input and the output for this command is the union of the input/output of the commands for parsing and the one for linearization.
Parameter | Description | Default |
---|---|---|
command |
should be translate |
- |
cat |
the start category for the parser | the default start category for the grammar |
input |
the input string to be translated | empty string |
from |
the source language | all languages in the grammar will be tried |
to |
the target language | linearizations for all languages in the grammar will be generated |
limit |
limit how many parse trees are used (gf>3.3.3) | no limit is applied |
The output is a list of objects with these fields:
Field | Description |
---|---|
from |
the concrete language used in the parsing |
brackets |
the bracketed string from the parser |
translations |
list of translations |
typeErrors |
list of errors from the type checker |
Every translation is an object with two fields:
tree |
abstract syntax tree |
linearizations |
list of linearizations |
Every linearization is an object with two fields:
Field | Description |
to |
the concrete language used in the linearization |
text |
the sentence produced |
The type errors are objects with two fields:
Field | Description |
---|---|
fid |
forest id which points to a bracket in the bracketed string where the error occurs |
msg |
the text message for the error |
The current implementation either returns a list of translations or a list of type errors. By checking whether the field translations is not null we check whether the type checking was successful.
This command generates random abstract syntax tree where the top-level function will be of the specified category. The categories for the sub-trees will be determined by the type signatures of the parent function.
Parameter | Description | Default |
---|---|---|
command |
should be random |
- |
cat |
the start category for the generator | the default start category for the grammar |
limit |
maximal number of trees generated | 1 |
The output is a list of objects with only one field:
Field | Description |
---|---|
tree |
the generated abstract syntax tree |
The length of the list is limited by the limit parameter.
Word completion is a special case of parsing. If there is an incomplete sentence then it is first parsed and after that the state of the parse chart is used to predict the set of words that could follow in a grammatically correct sentence.
Parameter | Description | Default |
---|---|---|
command |
should be complete |
- |
cat |
the start category for the parser | the default start category for the grammar |
input |
the string to the left of the cursor that is already typed | empty string |
from |
the name of the concrete syntax to use for parsing | all languages in the grammar will be tried |
limit |
maximal number of trees generated | all words will be returned |
The output is a list of objects with two fields which describe the completions.
Field | Description |
---|---|
from |
the concrete syntax for this word |
text |
the word itself |
This command renders an abstract syntax tree into an image. Several image formats are supported.
Parameter | Description | Default |
---|---|---|
command |
should be abstrtree |
- |
tree |
the abstract syntax tree to render | - |
format |
output format (gf>3.3.3) | png |
By default, the output is an image in PNG format. The Content-Type is set to image/png
, so the easiest way to visualize the generated image is to add HTML element <img/>
which points to URL for the visualization command i.e.:
<img src="http://localhost/my_grammar.pgf?command=abstrtree&tree=..."/>
The format
parameter can also be gif
, svg
or gv
, for GIF (image/gif
), SVG (image/svg+xml
) or graphviz (text/plain
) format, respectively.
This command renders the parse tree that corresponds to a specific abstract syntax tree. The generated image is in PNG format.
Parameter | Description | Default |
---|---|---|
command |
should be parsetree |
- |
tree |
the abstract syntax tree to render | - |
from |
the name of the concrete syntax to use in the rendering | - |
format |
output format (gf>3.3.3) | png |
options | additional rendering options (gf>3.4) | - |
The additioal rendering options are: noleaves
, nofun
and nocat
(booleans, false by default); nodefont
, leaffont
, nodecolor
, leafcolor
, nodeedgestyle
and leafedgestyle
(strings, have builtin defaults).
By default, the output is an image in PNG format. The Content-Type is set to image/png
, so the easiest way to visualize the generated image is to add HTML element <img/>
which points to URL for the visualization command i.e.:
<img src="http://localhost/my_grammar.pgf?command=parsetree&tree=..."/>
The format
parameter can also be gif
, svg
or gv
, for GIF (image/gif
), SVG (image/svg+xml
) or graphviz (text/plain
) format, respectively.
This command renders the word alignment diagram for some sentence and all languages in the grammar. The sentence is generated from a given abstract syntax tree.
Parameter | Description | Default |
---|---|---|
command |
should be alignment |
- |
tree |
the abstract syntax tree to render | - |
format |
output format (gf>3.3.3) | png |
to |
list of languages to include in the diagram (gf>3.4) | all languages supported by the grammar |
By default, the output is an image in PNG format. The Content-Ttype is set to image/png
, so the easiest way to visualize the generated image is to add HTML element <img/>
which points to URL for the visualization command i.e.:
<img src="http://localhost/my_grammar.pgf?command=alignment&tree=..."/>
The format
parameter can also be gif
, svg
or gv
, for GIF (image/gif
), SVG (image/svg+xml
) or graphviz (text/plain
) format, respectively.
This command (available in GF>=3.8) outputs word dependency diagrams in various format.
Parameter | Description | Default |
---|---|---|
command |
should be deptree |
- |
tree |
the abstract syntax tree to render | - |
format |
output format, see below | dot |
to |
name of the concrete syntax to use in the diagram | - |
The format
is one of the following:
png
, gif
, gv
: rendered with graphviz,svg
, latex
: universal dependency diagrams, in SVG format for use in web pages or as LaTeX Picture code for use in LaTeX documents,conll
, malt_tab
and malt_input
: text formats
There a few additional commands that lack proper documentation:
abstrjson
, browse
, download
, generate
, linearizeAll
, linearizeTable
, lookupmorpho
, translategroup
.See the source code for details.
GF includes two implementations of the PGF API: the traditional Haskell implementation and the newer C implementation. The commands documented above all use the Haskell implementation. The following commands use the C implementation instead:
c-parse
, c-linearize
, c-linearizeAll
, c-translate
, c-lookupmorpho
, c-flush
, c-grammar
, c-abstrtree
, c-parsetree
, c-wordforword
.They implement the same functionality as the corresponding commands without the c-
prefix, although there are some restrictions in what parameters they support, and some differences in the JSON data structures they output.
When using these commands, the grammar will be loaded and cached by the C run-time system. If you use commands from both the Haskell and C implementations with the same grammar, the grammar will be loaded twice.