vectorlib_files.dox 12 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314
  1. /*! \page vlibFormat Vector format description
  2. by GRASS Development Team (http://grass.osgeo.org)
  3. \tableofcontents
  4. \section vlibDirectoryStructure Directory structure
  5. Vector map is stored in a number of data files. Vector map directory
  6. structure and file names were changed in GRASS 6 with respect to
  7. previous GRASS versions. All vector files for one vector map are
  8. stored in one directory:
  9. \verbatim
  10. $MAPSET/vector/vector_name/
  11. \endverbatim
  12. This directory contains these files:
  13. - <b>coor</b> - binary file, coordinates [former dig/ file] (see \ref vlibCoorFileFormat)
  14. - <b>topo</b> - binary file, topology [former dig_plus/ file] (see \ref vlibTopoFileFormat)
  15. - <b>sidx</b> - binary file, spatial index (see \ref vlibSidxFileFormat)
  16. - <b>cidx</b> - binary file, category index (see \ref vlibCidxFileFormat)
  17. - <b>head</b> - text file, header information [former part of dig/ file] (see \ref vlibHeadFileFormat)
  18. - <b>dbln</b> - text file, link(s) to attribute table(s) (see \ref vlibDblnFileFormat)
  19. - <b>hist</b> - text file, vector map change history
  20. - <b>frmt</b> - text file, format description (external formats only)
  21. - <b>fidx</b> - binary file, feature index (OGR format only)
  22. \section vlibHeadFileFormat Header file format specification
  23. The header contains meta information, a description of the
  24. vector map and many other information. The file is an unordered list
  25. of key/value entries. The <i>key</i> is a string separated from
  26. <i>value</i> by a colon and optional whitespace.
  27. Keywords are:
  28. - ORGANIZATION - organization that digitized the data
  29. - DIGIT DATE - date the data was digitized
  30. - DIGIT NAME - person who digitized the data
  31. - MAP NAME - title of the original source map
  32. - MAP DATE - date of the original source map
  33. - MAP SCALE - scale of the original source map
  34. - OTHER INFO - other comments about the map
  35. - ZONE - zone of the map (e.g., UTM zone)
  36. - MAP THRESH - digitizing threshold
  37. This information holds \ref dig_head data structure.
  38. \section vlibDblnFileFormat DB link file format specification
  39. Each vector maps has its own DBMI settings stored in the
  40. '$MAPSET/vector/vector_name/dbln' text file. For each pair <em>vector map +
  41. layer</em>, all of <em>table, key column, database, driver</em> must be
  42. defined in a new row. This definition must be written to
  43. '$MAPSET/vector/vector_name/dbln' text file. Each row in the 'dbln'
  44. file contains names separated by spaces in following order ([ ] -
  45. optional):
  46. \verbatim
  47. map[@mapset] layer table [key [database [driver]]]
  48. \endverbatim
  49. If key, database or driver are omitted (on second and higher row only)
  50. the last definition is used. When reading a vector map from another
  51. mapset (if mapset is specified along with map name), definitions in
  52. the related "dbln" file may overwrite the DBMI definition in the
  53. current mapset. This means that the map-wise definition is always
  54. "stronger".
  55. Wild cards <b>*</b> and <b>?</b> may be used in map and mapset names.
  56. Variables $GISDBASE, $LOCATION_NAME, $MAPSET, and $MAP may be used in
  57. table, key, database and driver names (function
  58. Vect_subst_var()). Note that $MAPSET is not the current mapset but
  59. mapset of the map the rule is defined for.
  60. Note that vector features in GRASS vector maps may have attributes in
  61. different tables or may be without attributes. Boundaries form areas
  62. but it may happen that some boundaries are not closed (such boundaries
  63. would not appear in polygon layer). Boundaries may have
  64. attributes. All types may be mixed in one vector map.
  65. The link to the table is permanent and it is stored in 'dbln' file in
  66. vector directory. Tables are considered to be a part of the vector and
  67. the command <tt>g.remove</tt>, for example, deletes linked tables of
  68. the vector. Attributes must be joined with geometry.
  69. Information about database links holds \ref dblinks data structure.
  70. <b>Examples:</b>
  71. Examples are written mostly for the DBF driver, where database is full
  72. path to the directory with dbf files and table name is the name of dbf
  73. file without .dbf extension:
  74. \verbatim
  75. * 1 mytable id $GISDBASE/$LOCATION_NAME/$MAPSET/vector/$MAP dbf
  76. \endverbatim
  77. This definition says that entities with category of layer 1 are linked
  78. to dbf tables with names "mytable.dbf" saved in vector directories of
  79. each map. The attribute column containing the category numbers is
  80. called "id".
  81. \verbatim
  82. * 1 $MAP id $GISDBASE/$LOCATION_NAME/$MAPSET/dbf dbf
  83. \endverbatim
  84. Similar as above but all dbf files are in one directory dbf/ in mapset
  85. and names of dbf files are $MAP.dbf
  86. \verbatim
  87. water* 1 rivers id /home/grass/dbf dbf
  88. water* 2 lakes lakeid /home/guser/mydb
  89. trans* 1 roads key basedb odbc
  90. trans* 5 rails
  91. \endverbatim
  92. These definitions define more layers (called "field" in the API) for
  93. one vector map i.e. in one vector map may be more features linked to
  94. more attribute tables. Definitions on first 2 rows are applied for
  95. example on maps water1, water2, ... so that more maps may share one
  96. table.
  97. \verbatim
  98. water@PERMANENT 1 myrivers id /home/guser/mydbf dbf
  99. \endverbatim
  100. This definion overwrites the definition saved in PERMANENT/VAR and
  101. links the water map from PERMANENT mapset to the user's table.
  102. Modules should be written so that connections to databases for each
  103. vector layer are independent. It should be possible to read attributes
  104. of an input vector map from one database and write to some other and
  105. even with some other driver (should not be a problem).
  106. There are open questions, however. For one, how does one distinguish when
  107. new tables should be written and when not? For example, definitions:
  108. \verbatim
  109. river 1 river id water odbc
  110. river.backup* 1 NONE
  111. \endverbatim
  112. could be used to say that tables should not be copied for backups of
  113. map river because table is stored in a reliable RDBMS.
  114. \section vlibs Vector libraries
  115. Besides internal library functions there are two main libraries:
  116. - Vlib (Vector library), see \ref vlibIntro
  117. - DGLib (Directed Graph Library), see \ref dglib
  118. For historical reasons, there are two internal libraries:
  119. - diglib (with dig_*() functions), GRASS 3.x/4.x
  120. - Vlib (with V1_*(), V2_*() and Vect_*() functions), since GRASS 4.x
  121. (except for the 5.7 interim version)
  122. The vector library was introduced in GRASS 4.0 to hide internal vector
  123. files' formats and structures. In GRASS 6/7, everything is accessed via
  124. Vect_*() functions, for example:
  125. Old 4.x code:
  126. \code
  127. xx = Map.Att[Map.Area[area_num].att].x;
  128. \endcode
  129. New 6.x/7.x functions:
  130. \code
  131. centroid = Vect_get_area_centroid(Map, area_num);
  132. Vect_read_line(Map, line_p, NULL, centroid);
  133. Vect_line_get_point(line_p, 0, &xx, NULL, NULL);
  134. \endcode
  135. In GRASS 6/7, all internal, mostly non-topological vector functions are
  136. hidden from the modules' API (mainly dig_*(), V1_*() and V2_*()
  137. functions). All available Vect_*() functions are topological vector
  138. functions.
  139. The following include file contains definitions and structures
  140. required by some of the routines in this library. The programmer
  141. should therefore include this file in any code that uses the vector
  142. library:
  143. \code
  144. #include <grass/vector.h>
  145. \endcode
  146. <i>Note: For details please read Blazek et al. 2002 (see below) as
  147. well as the references in this document.</i>
  148. \subsection vlibHistory Historical notes
  149. The vector library in GRASS 4.0 changed significantly from the
  150. <em>Digit Library</em> (diglib) used in GRASS 3.1. Below is an
  151. overview of why the changes were made.
  152. The Digit Library was a collage of subroutines created for developing
  153. the map development programs. Few of these subroutines were actually
  154. designed as a user access library. They required individuals to assume
  155. too much responsibility and control over what happened to the data
  156. file. Thus when it came time to change vector data file formats for
  157. GRASS 4.0, many modules also required modification. The two different
  158. access levels for 3.0 vector files provided very different ways of
  159. calling the library; they offered little consistency for the user.
  160. The Digit Library was originally designed to only have one file open
  161. for read or write at a time. Although it was possible in some cases to
  162. get around this, one restriction was the global head structure. Since
  163. there was only one instance of this, there could only be one copy of
  164. that information, and thus, only one open vector file.
  165. The solution to these problems was to design a new user library as an
  166. interface to the vector data files. This new library was designed to
  167. provide a simple consistent interface, which hides as much of the
  168. details of the data format as possible. It also could be extended for
  169. future enhancements without the need to change existing programs.
  170. The new vector library in GRASS 4 provided routines for opening,
  171. closing, reading, and writing vector files, as well as several support
  172. functions. The Digit Library has been replaced, so that all existing
  173. modules was converted to use the new library. Those routines that
  174. existed in the Digit Library and were not affected by these changes
  175. continue to exist in unmodified form, and were included in the vector
  176. library. Most of the commonly used routines have been discarded, and
  177. replaced by the new vector routines.
  178. Instead the global head structure was used own local version of
  179. it. The structure that replaced structure head is structure \ref
  180. dig_head. There were still two levels of interface to the vector files
  181. (future releases may include more). Level one provided access only to
  182. arc (i.e. polyline) information and to the type of line (AREA, LINE,
  183. DOT). Level two provided access to polygons (areas), attributes, and
  184. network topology.
  185. \subsection vlibCoorFileFormat Coor file format specification
  186. In the coor file the following is stored: 'line' (element) type,
  187. number of attributes and layer number for each category. Coordinates
  188. in binary file are stored as double (8 bytes). See \ref Coor_info data
  189. structure.
  190. \subsubsection vlibCoorFileHead Header
  191. <table border="1" style="border-collapse: collapse" cellpadding="5">
  192. <tr><td><b>Name</b></td><td><b>Type</b></td><td><b>Number</b></td><td><b>Description</b></td></tr>
  193. <tr><td>Version_Major</td> <td>C</td> <td>1</td> <td>file version (major)</td></tr>
  194. <tr><td>Version_Minor</td> <td>C</td> <td>1</td> <td>file version (minor)</td></tr>
  195. <tr><td>Back_Major</td> <td>C</td> <td>1</td> <td>supported from GRASS version (major)</td></tr>
  196. <tr><td>Back_Minor</td> <td>C</td> <td>1</td> <td>supported from GRASS version (minor)</td></tr>
  197. <tr><td>byte_order</td> <td>C</td> <td>1</td> <td>little or big endian flag</td></tr>
  198. <tr><td>head_size</td> <td>L</td> <td>1</td> <td>header size of coor file</td></tr>
  199. <tr><td>with_z</td> <td>C</td> <td>1</td> <td>2D or 3D flag; zero for 2D</td></tr>
  200. <tr><td>size</td> <td>L</td> <td>1</td> <td>coor file size</td></tr>
  201. </table>
  202. \section vlibCoorFileBody Body
  203. The body consists of line records:
  204. <table border="1" style="border-collapse: collapse" cellpadding="5">
  205. <tr><td><b>Name</b></td><td><b>Type</b></td><td><b>Number</b></td><td><b>Description</b></td></tr>
  206. <tr><td>record header</td><td>C</td><td>1</td><td>
  207. - 0. bit: 1 - alive, 0 - dead line
  208. - 1. bit: 1 - categories, 0 - no categories
  209. - 2.-3. bit: type - one of: GV_POINT, GV_LINE, GV_BOUNDARY, GV_CENTROID, GV_FACE, GV_KERNEL
  210. - 4.-7. bit: reserved, not used
  211. </td></tr>
  212. <tr><td>ncats</td><td>I</td><td>1</td><td>number of categories
  213. (written only if categories exist) </td></tr>
  214. <tr><td>field</td><td>I</td><td>ncats</td><td>field identifier,
  215. distinguishes between more categories append to one feature (written
  216. only if categories exist; field is called "layer" at user
  217. level)</td></tr>
  218. <tr><td>cat</td><td>I</td><td>ncats</td><td>category value (written
  219. only if categories exist)</td></tr>
  220. <tr><td>ncoor</td><td>I</td><td>1</td><td>written for GV_LINES and GV_BOUNDARIES
  221. only</td></tr>
  222. <tr><td>x</td><td>D</td><td>ncoor</td><td>x coordinate</td></tr>
  223. <tr><td>y</td><td>D</td><td>ncoor</td><td>y coordinate</td></tr>
  224. <tr><td>z</td><td>D</td><td>ncoor</td><td>z coordinate; present if
  225. with_z in head is set to 1</td></tr> </table>
  226. Types used in coor file:
  227. <table border="1" style="border-collapse: collapse" cellpadding="5">
  228. <tr><td><b>Type</b></td><td><b>Name</b></td><td><b>Size in Bytes</b></td></tr>
  229. <tr><td>D</td><td>Double</td><td>8</td></tr>
  230. <tr><td>L</td><td>Long </td><td>4</td></tr>
  231. <tr><td>I</td><td>Int </td><td>4</td></tr>
  232. <tr><td>S</td><td>Short </td><td>4</td></tr>
  233. <tr><td>C</td><td>Char </td><td>1</td></tr>
  234. </table>
  235. */