vectorlib_files.dox 12 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320
  1. /*! \page vlibFormat Vector format description
  2. by GRASS Development Team (http://grass.osgeo.org)
  3. \par Table of contents
  4. - \ref vlibDirectoryStructure
  5. - \ref vlibDblnFileFormat
  6. - \ref vlibCoorFileFormat
  7. - \ref vlibCoorFileHead
  8. - \ref vlibCoorFileBody
  9. \section vlibDirectoryStructure Directory structure
  10. Vector map is stored in a number of data files. Vector map directory
  11. structure and file names were changed in GRASS 6 with respect to
  12. previous GRASS versions. All vector files for one vector map are
  13. stored in one directory:
  14. \verbatim
  15. $MAPSET/vector/vector_name/
  16. \endverbatim
  17. This directory contains these files:
  18. - <b>coor</b> - binary file, coordinates [former dig/ file] (see \ref vlibCoorFileFormat)
  19. - <b>topo</b> - binary file, topology [former dig_plus/ file] (see \ref vlibTopoFileFormat)
  20. - <b>sidx</b> - binary file, spatial index (see \ref vlibSidxFileFormat)
  21. - <b>cidx</b> - binary file, category index (see \ref vlibCidxFileFormat)
  22. - <b>head</b> - text file, header information [former part of dig/ file] (see \ref vlibHeadFileFormat)
  23. - <b>dbln</b> - text file, link(s) to attribute table(s) (see \ref vlibDblnFileFormat)
  24. - <b>hist</b> - text file, vector map change history
  25. - <b>frmt</b> - text file, format description (external formats only)
  26. - <b>fidx</b> - binary file, feature index (OGR format only)
  27. \section vlibHeadFileFormat Header file format specification
  28. The header contains meta information, a description of the
  29. vector map and many other information. The file is an unordered list
  30. of key/value entries. The <i>key</i> is a string separated from
  31. <i>value</i> by a colon and optional whitespace.
  32. Keywords are:
  33. - ORGANIZATION - organization that digitized the data
  34. - DIGIT DATE - date the data was digitized
  35. - DIGIT NAME - person who digitized the data
  36. - MAP NAME - title of the original source map
  37. - MAP DATE - date of the original source map
  38. - MAP SCALE - scale of the original source map
  39. - OTHER INFO - other comments about the map
  40. - ZONE - zone of the map (e.g., UTM zone)
  41. - MAP THRESH - digitizing threshold
  42. This information holds \ref dig_head data structure.
  43. \section vlibDblnFileFormat DB link file format specification
  44. Each vector maps has its own DBMI settings stored in the
  45. '$MAPSET/vector/vector_name/dbln' text file. For each pair <em>vector map +
  46. layer</em>, all of <em>table, key column, database, driver</em> must be
  47. defined in a new row. This definition must be written to
  48. '$MAPSET/vector/vector_name/dbln' text file. Each row in the 'dbln'
  49. file contains names separated by spaces in following order ([ ] -
  50. optional):
  51. \verbatim
  52. map[@mapset] layer table [key [database [driver]]]
  53. \endverbatim
  54. If key, database or driver are omitted (on second and higher row only)
  55. the last definition is used. When reading a vector map from another
  56. mapset (if mapset is specified along with map name), definitions in
  57. the related "dbln" file may overwrite the DBMI definition in the
  58. current mapset. This means that the map-wise definition is always
  59. "stronger".
  60. Wild cards <b>*</b> and <b>?</b> may be used in map and mapset names.
  61. Variables $GISDBASE, $LOCATION_NAME, $MAPSET, and $MAP may be used in
  62. table, key, database and driver names (function
  63. Vect_subst_var()). Note that $MAPSET is not the current mapset but
  64. mapset of the map the rule is defined for.
  65. Note that vector features in GRASS vector maps may have attributes in
  66. different tables or may be without attributes. Boundaries form areas
  67. but it may happen that some boundaries are not closed (such boundaries
  68. would not appear in polygon layer). Boundaries may have
  69. attributes. All types may be mixed in one vector map.
  70. The link to the table is permanent and it is stored in 'dbln' file in
  71. vector directory. Tables are considered to be a part of the vector and
  72. the command <tt>g.remove</tt>, for example, deletes linked tables of
  73. the vector. Attributes must be joined with geometry.
  74. Information about database links holds \ref dblinks data structure.
  75. <b>Examples:</b>
  76. Examples are written mostly for the DBF driver, where database is full
  77. path to the directory with dbf files and table name is the name of dbf
  78. file without .dbf extension:
  79. \verbatim
  80. * 1 mytable id $GISDBASE/$LOCATION_NAME/$MAPSET/vector/$MAP dbf
  81. \endverbatim
  82. This definition says that entities with category of layer 1 are linked
  83. to dbf tables with names "mytable.dbf" saved in vector directories of
  84. each map. The attribute column containing the category numbers is
  85. called "id".
  86. \verbatim
  87. * 1 $MAP id $GISDBASE/$LOCATION_NAME/$MAPSET/dbf dbf
  88. \endverbatim
  89. Similar as above but all dbf files are in one directory dbf/ in mapset
  90. and names of dbf files are $MAP.dbf
  91. \verbatim
  92. water* 1 rivers id /home/grass/dbf dbf
  93. water* 2 lakes lakeid /home/guser/mydb
  94. trans* 1 roads key basedb odbc
  95. trans* 5 rails
  96. \endverbatim
  97. These definitions define more layers (called "field" in the API) for
  98. one vector map i.e. in one vector map may be more features linked to
  99. more attribute tables. Definitions on first 2 rows are applied for
  100. example on maps water1, water2, ... so that more maps may share one
  101. table.
  102. \verbatim
  103. water@PERMANENT 1 myrivers id /home/guser/mydbf dbf
  104. \endverbatim
  105. This definion overwrites the definition saved in PERMANENT/VAR and
  106. links the water map from PERMANENT mapset to the user's table.
  107. Modules should be written so that connections to databases for each
  108. vector layer are independent. It should be possible to read attributes
  109. of an input vector map from one database and write to some other and
  110. even with some other driver (should not be a problem).
  111. There are open questions, however. For one, how does one distinguish when
  112. new tables should be written and when not? For example, definitions:
  113. \verbatim
  114. river 1 river id water odbc
  115. river.backup* 1 NONE
  116. \endverbatim
  117. could be used to say that tables should not be copied for backups of
  118. map river because table is stored in a reliable RDBMS.
  119. \section vlibs Vector libraries
  120. Besides internal library functions there are two main libraries:
  121. - Vlib (Vector library), see \ref vlibIntro
  122. - DGLib (Directed Graph Library), see \ref vlibDglib
  123. For historical reasons, there are two internal libraries:
  124. - diglib (with dig_*() functions), GRASS 3.x/4.x
  125. - Vlib (with V1_*(), V2_*() and Vect_*() functions), since GRASS 4.x
  126. (except for the 5.7 interim version)
  127. The vector library was introduced in GRASS 4.0 to hide internal vector
  128. files' formats and structures. In GRASS 6/7, everything is accessed via
  129. Vect_*() functions, for example:
  130. Old 4.x code:
  131. \code
  132. xx = Map.Att[Map.Area[area_num].att].x;
  133. \endcode
  134. New 6.x/7.x functions:
  135. \code
  136. centroid = Vect_get_area_centroid(Map, area_num);
  137. Vect_read_line(Map, line_p, NULL, centroid);
  138. Vect_line_get_point(line_p, 0, &xx, NULL, NULL);
  139. \endcode
  140. In GRASS 6/7, all internal, mostly non-topological vector functions are
  141. hidden from the modules' API (mainly dig_*(), V1_*() and V2_*()
  142. functions). All available Vect_*() functions are topological vector
  143. functions.
  144. The following include file contains definitions and structures
  145. required by some of the routines in this library. The programmer
  146. should therefore include this file in any code that uses the vector
  147. library:
  148. \code
  149. #include <grass/vector.h>
  150. \endcode
  151. <i>Note: For details please read Blazek et al. 2002 (see below) as
  152. well as the references in this document.</i>
  153. \subsection vlibHistory Historical notes
  154. The vector library in GRASS 4.0 changed significantly from the
  155. <em>Digit Library</em> (diglib) used in GRASS 3.1. Below is an
  156. overview of why the changes were made.
  157. The Digit Library was a collage of subroutines created for developing
  158. the map development programs. Few of these subroutines were actually
  159. designed as a user access library. They required individuals to assume
  160. too much responsibility and control over what happened to the data
  161. file. Thus when it came time to change vector data file formats for
  162. GRASS 4.0, many modules also required modification. The two different
  163. access levels for 3.0 vector files provided very different ways of
  164. calling the library; they offered little consistency for the user.
  165. The Digit Library was originally designed to only have one file open
  166. for read or write at a time. Although it was possible in some cases to
  167. get around this, one restriction was the global head structure. Since
  168. there was only one instance of this, there could only be one copy of
  169. that information, and thus, only one open vector file.
  170. The solution to these problems was to design a new user library as an
  171. interface to the vector data files. This new library was designed to
  172. provide a simple consistent interface, which hides as much of the
  173. details of the data format as possible. It also could be extended for
  174. future enhancements without the need to change existing programs.
  175. The new vector library in GRASS 4 provided routines for opening,
  176. closing, reading, and writing vector files, as well as several support
  177. functions. The Digit Library has been replaced, so that all existing
  178. modules was converted to use the new library. Those routines that
  179. existed in the Digit Library and were not affected by these changes
  180. continue to exist in unmodified form, and were included in the vector
  181. library. Most of the commonly used routines have been discarded, and
  182. replaced by the new vector routines.
  183. Instead the global head structure was used own local version of
  184. it. The structure that replaced structure head is structure \ref
  185. dig_head. There were still two levels of interface to the vector files
  186. (future releases may include more). Level one provided access only to
  187. arc (i.e. polyline) information and to the type of line (AREA, LINE,
  188. DOT). Level two provided access to polygons (areas), attributes, and
  189. network topology.
  190. \subsection vlibCoorFileFormat Coor file format specification
  191. In the coor file the following is stored: 'line' (element) type,
  192. number of attributes and layer number for each category. Coordinates
  193. in binary file are stored as double (8 bytes). See \ref Coor_info data
  194. structure.
  195. \subsubsection vlibCoorFileHead Header
  196. <table border="1" style="border-collapse: collapse" cellpadding="5">
  197. <tr><td><b>Name</b></td><td><b>Type</b></td><td><b>Number</b></td><td><b>Description</b></td></tr>
  198. <tr><td>Version_Major</td> <td>C</td> <td>1</td> <td>file version (major)</td></tr>
  199. <tr><td>Version_Minor</td> <td>C</td> <td>1</td> <td>file version (minor)</td></tr>
  200. <tr><td>Back_Major</td> <td>C</td> <td>1</td> <td>supported from GRASS version (major)</td></tr>
  201. <tr><td>Back_Minor</td> <td>C</td> <td>1</td> <td>supported from GRASS version (minor)</td></tr>
  202. <tr><td>byte_order</td> <td>C</td> <td>1</td> <td>little or big endian flag</td></tr>
  203. <tr><td>head_size</td> <td>L</td> <td>1</td> <td>header size of coor file</td></tr>
  204. <tr><td>with_z</td> <td>C</td> <td>1</td> <td>2D or 3D flag; zero for 2D</td></tr>
  205. <tr><td>size</td> <td>L</td> <td>1</td> <td>coor file size</td></tr>
  206. </table>
  207. \section vlibCoorFileBody Body
  208. The body consists of line records:
  209. <table border="1" style="border-collapse: collapse" cellpadding="5">
  210. <tr><td><b>Name</b></td><td><b>Type</b></td><td><b>Number</b></td><td><b>Description</b></td></tr>
  211. <tr><td>record header</td><td>C</td><td>1</td><td>
  212. - 0. bit: 1 - alive, 0 - dead line
  213. - 1. bit: 1 - categories, 0 - no categories
  214. - 2.-3. bit: type - one of: GV_POINT, GV_LINE, GV_BOUNDARY, GV_CENTROID, GV_FACE, GV_KERNEL
  215. - 4.-7. bit: reserved, not used
  216. </td></tr>
  217. <tr><td>ncats</td><td>I</td><td>1</td><td>number of categories
  218. (written only if categories exist) </td></tr>
  219. <tr><td>field</td><td>I</td><td>ncats</td><td>field identifier,
  220. distinguishes between more categories append to one feature (written
  221. only if categories exist; field is called "layer" at user
  222. level)</td></tr>
  223. <tr><td>cat</td><td>I</td><td>ncats</td><td>category value (written
  224. only if categories exist)</td></tr>
  225. <tr><td>ncoor</td><td>I</td><td>1</td><td>written for GV_LINES and GV_BOUNDARIES
  226. only</td></tr>
  227. <tr><td>x</td><td>D</td><td>ncoor</td><td>x coordinate</td></tr>
  228. <tr><td>y</td><td>D</td><td>ncoor</td><td>y coordinate</td></tr>
  229. <tr><td>z</td><td>D</td><td>ncoor</td><td>z coordinate; present if
  230. with_z in head is set to 1</td></tr> </table>
  231. Types used in coor file:
  232. <table border="1" style="border-collapse: collapse" cellpadding="5">
  233. <tr><td><b>Type</b></td><td><b>Name</b></td><td><b>Size in Bytes</b></td></tr>
  234. <tr><td>D</td><td>Double</td><td>8</td></tr>
  235. <tr><td>L</td><td>Long </td><td>4</td></tr>
  236. <tr><td>I</td><td>Int </td><td>4</td></tr>
  237. <tr><td>S</td><td>Short </td><td>4</td></tr>
  238. <tr><td>C</td><td>Char </td><td>1</td></tr>
  239. </table>
  240. */