UNITED NATIONS
ECONOMIC COMMISSION FOR AFRICA
Regional Data Base
First Phase - Closing Seminar 5-8 July 1994, Addis Ababa
Original: ENGLISH
PROJECT RAF/89/046 ECA/IBRD/UNDP TRANSI*ORT SECTOR
Regional Transport Data Base
Addis Abeba - Fax (251)-1 -511441 /510391 - Telephone (251)-1-516551 B.P. 5515
IM KVSTKICTl UK UKQUKKI)
I OR Till: KS'l AHIJSMMI.M OlRi:<;i()NAJ, TRANSPORT 1)\TA IJAS1-: IN IX A
TABLE OF CONTENT
PAGE
1. INTRODUCTION ■ ■ 1
2. HIGHLIGHT OF TRANSPORT INFORMATION
SYSTEM IN ECA 1
3. USER NEEDS » 2
4. TRANSPORT DATA BASE STRUCTURE 6
5. RECOMMENDED MANPOWER AND INFRASTRUCTURE
REQUIREMENT AND IMPLEMENTATION PLAN 13
1 A comprehensive study dealing with the existing structure and future requirements at EGA for the collection, processing and dissemination of transport data is underway.
The study focuses mainly on reviewing the present situation and recommending appropriate infrastructure for the development, administration and maintenance of the transport data base as well all arrangements for its linkage with ECAs multisectoral statistical data base and other interested units in EGA.
2. This paper deals with the infrastructure required for the establishment of the regional transport data base which includes:
. ■ Highlights of transport information in ECA;
■ user needs;
■ the envisaged transport data base structure;
■ resource requirement and;
■ implementation plan.
3. The study is still in progress and the final recommendation shall be incorporated
after full coverage.
2. Highlight of transport information system in ECA
2.1 Present situation4 Presently transport data collection activities in ECA are undertaken by various sections units and modal experts in an ad hoc basis. Major sources and methods of collection include national publications of member States, staff mission to member States though not particularly for collection of data, regular reports from regional and international organizations and publication and regular reports of the various subregional
organizations in Africa.
5 Although the regional statistical data base (ECA-RSDB) incorporates the various transport modes it does not seem to satisfy the needs of transport data users in Transport, Communications and Tourism Division 0"CTD) for the following basic reasons.
■ The content of the statistical year book is mostly dominated by estimated values,
which undoubtedly affect the reliability of the data.■ Due to delays in the process of updating the available data, official publications like the statistical year book are not issued in time. At present latest issue of the statistical yearbook is that of 1990/91,
. The ECA RSDB does not cover fully the data needs of TCTD. At present only 16 transport data items are available in the present statistical data base for roads, road transport, railway and ports sub-sector. On the other hand the.total number of data items identified for the transport sector performance indicators is over 400.
Therefore, the existing data base comprises less than four per cent of the total
requirement.
6. Thus, the trahsport data situation requires complete change so as to make it more sound for use by experts in ECA and elsewhere.
2.2 Envisaged transport data base programme for ECA
7 Establishment of adequate transport statistics depends on the efficiency at the national level. Thus, ECA's basic role shall be, in cooperation with other agencies, to exert maximum effort in improving national data base situation. In the process ECA shall be in a position to collect, process, store and disseminate data and information to users.
8. Detailed coverage of the transport data base to be established in ECA is fully
covered in chapter 4 below.
3. USER NEEDS 3.1 General
9 Four user groups have already been identified during the previous studies1. At
the operational management levels users require data for monitoring and control of operations as well as preparation of budgets and investment plans. At the sectoral and sub-sectoral: level (mainly ministries and commissions) require data to monitor and control operational entities within the sector as well as make sector specific resource allocation and planning decisions. At the national level (mainly national planning commissions, and central government authorities) require data for monitoring the performance and needs of the different sectors and sub-sectors of the economy as well as in inter-sectoral resource allocation and policy development. At the international leveldata is required to
undertake:
1 A transport data base for Sub-Saharan Africa by
K.W.Studnicks - Gizbert and John Heads, June 1992.
. international comparisons which is used as instrument of experience transfer;
. an overall assessment of regional requirements and as input for regional studies
and development of regional programmes;
■ market research and planning at international scale;
. regional planning so as to accommodate the needs for various services specifically
for land-locked countries.
10 The focus of this chapter shall be to provide highlights of these needs at national and international level, and undertake the necessary ground work for establishing regional
transport data base in ECA.
3.2 Needs at the national level
11 The advance in technology is creating a situation where by countries are becoming more inter-dependent on one another for raw materials, markets, technology and technical know-how and, in the case of land-locked countries for access to various services. This interdependence is more emphasized in the transport and communication
sector. ,
12 Roads and road traffic crossing international boundaries, railway and railway traffic operating between two and more countries, ports, shipping lines, international airports and air traffic, are all international in nature and thus create significant impact on the development and social well-being of the nations affected. /
13 The primary objective of the establishment of transport data base in ECA isto provide appropriate service to all member countries of ECA. Member States shall particularly require the following data and information on transport sector.
(a) Roads and road traffic
14 Land-locked countries are vitally interested in the condition and road use policies
of transit countries. Particularly the land-locked countries need information relating to the
condition of the relevant routes as well as overall road and traffic situation, management
and cost (user taxes, weight and axle-load limitations, etc). On the other hand as regional
and subregional road networks become more and more integrated, road condition and
traffic data of other countries becomes more and more vital for national planning.
(b) Railways and Railway traffic
15 Where railway lines cross international borders and serve two and more countries the need for data on railway conditions and railway traffic is similar to that of roads and road traffic. In addition countries would like data for comparison and transfer of
experience.
(c) Air transport
16 This sector has already achieved high level of data and information integration through international agencies like ICAO. However, there remains a lot to be done in the area availing the stored information to members State in a user friendly manner so that it can be used effectively for national planning and economic management. Though air transport accounts for small part of total transport needs it has a considerable impact in the national economy. In particular it plays a key role in:
■ international passenger transport;
. import/export of spare parts, high value goods, perishable products;
■ facilitating religious pilgrims etc.
17 Air transport industry is one of the most competitive sectors in the world and, thus to improve service and productivity, nations require up to date comparative data of other
nations and various economic sectors.
(d) Ports and shipping
18 Ports and shipping play a key role in international cargo transport. The efficiency at ports and the dependability of ships adversely affect the dynamism of national economies. Port and shipping data is crucial in any national planning. Specially land locked countries require data from other nations not only for national plans but for day
to day operation.
3.3 Need at the regional and subreaional level
19. ECA and other subregional organizations operating within or outside the ECA system play a key role in facilitating the economic development of Africa. In particular ECA is mandated to play a crucial role in strengthening the economic relation of countries and territories of Africa, both among themselves and with other countries of the world.
20. In' the field of transport ECA, in cooperation with other agencies, is responsible for:
■ conceptualization and promotion of concerted plans for the orderly development of an integrated Pan-African transport and communications network;
■ drawing up of national plans by integrating subregional ones where they exist and,
where they do not, formulating them in cooperation with governments and relevant agencies;■ coordination of the overall decade activities in order to achieve rational, the most cost effective and synchronized development of the transport network;
■ making proposals for concrete actions for the standardization of systems where
necessary; and■ rendering technical assistance to the inter-governmental organizations when they
need assistance.
21. The activities outlined above are, among other things, the driving force which require accurate and timely transport information and the establishment of regional
transport data base in ECA.3.4 Need at the international level
22. International financing institutions, donor agencies and all economic and social
oriented organizations require transport data for:■ project evaluation and financing;
■ assessment of impacts of transport on other economic sectors and development as a whole;
■ monitoring project performance;
■ evaluation of projects;
■ focation of projects;
■ undertaking research of every kind;
b planning training programmes; and
■ analysis of development gaps and allocation of resource aimed at reducing these
gaps.
3.5 Analytical requirements of UNECA
23. Principally the transport data needs of" ECA are highly aggregated in nature with appropriate international comparisons. In order to satisfy these needs the system to be developed shall basically be highly flexible and provide room for changes and growth.
For the start the indicators identified and agreed upon during the launching seminar shall
be the basis. The system shall maintain both the data and indictors separately and shall provide the necessary tools for moving from one mode to the other and vice versa.24. Data and indications shall also be aggregated to show national, subregional,
regional comparisons with similar data of comparable characteristics.25. Other aggregation and comparison shall include:
■ intermodal within each country (e.g. no of passenger using various modes of
transport, accident, situation, financial"situation, investment, productivity etc.)
■ Intermodal on subregional and regional basis;
■ Level of dependency on various modes at national, subregional, regional level;'
■ comparison against internationally accepted norms;
■ growth/decline rates of various modes over a number of years.
26. The system shall be capable of these, and other similar needs of today and for
some years to come.
3.6 Dissemination of information and its frequency
27. The updating of data shall be on a continuous basis and its dissemination shall be as soon as the updating of a targeted year is complete.
28. The basic tool for data and information dissemination shall remain to be the annual transport statistics year book and similar documents. ' •
29. Users shall also have access to the transport data through electronic means such as direct on-line access, E-mail service, CD-ROM, and other magnetic media. For ease of use proper documentation shall be produced and issued to users.
4.
30. Detailed design of the transport data base including record layout for inputs and files, screen layouts, output, layout etc. shall be worked out during the design phase of the project. In this chapter a conceptual framework of the data base for estimating the necessary resources is included.
4.1 Record structure
4.1.1 Data file
31. There are a lot of options available for structuring data file record. For the purpose of estimating data volume and data preparation, the record structure shown below shall be used. Accordingly, the data base shall be organized by data item, subregion-wise within each data item and country wise within each subregion. Each record is designed to contain time series data for about 20 years. The entry level record layout of the data file shall be as follows.
Data Item
Sub- Region Code
Country Code
Unit of Observation
Starting -
■ Year
Data Yeari
1
Footnote j Data Yearl '..Year '20
ii :
Footnote Year 20
The relative size of each data element in the record shall be indicated in Table 1
Table 1 Relative size of data elements in data file recordDescription Data item code Subregion Code Country code
Unit of observation code Starting year
Data-year Footnote year 1 Data-year 1
Footnote year 20 Total Record Size
Size and characteristics
6 " ' , .
1 3 2 4 10 3
10
■
3 256
Alphanumeric Alphanumeric Alphanumeric Alphanumeric Numeric Numeric Alphanumeric
Numeric
■
■
Alphanumeric
6 bytes32. It should be noted that the above structure is used only for determining the data
volume and other relevant estimates.
33. In order to satisfy the user needs outlined in Chapter 3 above the data shall also be organized by country, by subregion, by region, by mode of transport etc. Issues related to organizing data in relation to user needs is expected to be dealt extensively during the design phase of the transport data base project.
4.1.2 Reference files
34. The following basic reference files shall be maintained.
(a) Data series description file
Code 6 Alphanumeric
Name of series - English 40 Alphanumeric Name of series - French 40 Alphanumeric
Total 86
Subreaion description
\ f — *m ■
Code
Name - English Name - French
Total (c^ Countrv Code
Name Currency
Total
(6) Unit of observation Code
Description Total (e) Footnotes
Code Description
Total
Data Volume
1 40 40 81
3 40 20 63
2 20 22
3 40 53
Alphanumeric Alphanumeric Alphanumeric
Alphanumeric Alphanumeric Alphanumeric
Alphanumeric Alphanumeric
Alphanumeric Alphanumeric
4.2
35. The total estimated data size for the entire transport data base including reference files is about 7.6 MB. Estimated data size for initial establishment for the transport data base is 1,508 MB
4.3 Data capture modes and frequency
4.3.1 Data capture modes
36. This system assumes that the majority of data and information shall be provided by focal points in member States. Therefore, it is difficult to specify the type of data capture mode to be employed without sufficient background on EDP development in each
country. Thus all the following alternatives can be considered applicable for the study at hand.(a) Data shall be received from member States through formats or completion/amendment of computer printout to be sent regularly to the member States.
Data can also be received in a form of regular printed publication from both the member States, subregional and international organizations. In such cases the data entry shall be carried out by EGA staff.
(b) Data can also be received from member States and other International organizations through use of magnetic media (e.g. diskettes). In such cases the data received shall be reformatted (if not in the same format as the data base) before updating.
(c) Data base files residing in International organizations, subregional organizations and more advanced focal points in member States can be accessed on on-line or through
a bulletin board systems (E-Mail). Both system provide very efficient access to data.
However, on-line access of data bases is not recommended due to high telecommunications costs. On the other hand E-mail is efficient and cheap method of data transfer. Similar method outlined in 4.3.1 (b) above shall be used for updating the data base.
4.3.2 Data capture and updating frequencies
37. The regional data base is assumed to contain time series data of annual national transport statistics. Therefore, data to be collected from both national, subregional and international organization shall be on yearly bases. However, the actual receipt of data depend on the efficiency and the willingness of each country. Thus, data capture and updating shall be undertaken continuously as and when data is received. It should be noted at this juncture that the overall implementation of the transport data base system is primarily dependent on the data to be collected from member States. Therefore, properly designed and agreed data collection methods and frequencies should be in place for proper implementation of the regional transport data base.
4.4 Data preparation estimates 4.4.1 Basic assumption
38. The estimates for coding/checking, data editing and edition for the data base are based on the following basic assumptions
(a) Total data in each transport sub-sector
■ Road sub-sector
■ Road transport sub-sector
■ Railway sub-sector
■ Ports sub-sector
■ Air traffic
■ Airports sUb-sector
■ Inland Waters
■ Urban transport
(b) Number of countries (c) Number of ports
(d) Total data size for the entire data base
(e) Reference files size
(f) Estimated data size for initial establishment - first year
■ data file
■ reference files TOTAL
(g) Annual data
(h) Annual Working days
(i) Daily effective working hour
160 150 74 50 20 20 30 50
<
7.6 million .714 million
items items items items items items items items 53 60
bytes bytes
.794 MB .714 MB 1.508 MB .8MB 240
(j) Average production rate per hour
per operator 5000 bytes
(k) Assumed mode of data capture**
75 per cent through direct data entry 25 per cent through electronic means 4.4.2 Checking and coding
39. During the initial establishment phase of the data base, about 1.5 MB of transport and reference files data shall be checked, coded and entered into the system. Checking and coding of the initial data requires relatively qualified personnel in the field of transport statistics with strong computer systems background. The checking of the annual transaction of about .8 MB shall be undertaken on a continuous basis.
4.4.2 Keying-in, Verification and editing faVThe initial set-up
40. The initial establishment requires keying-in, verification and editing of about 1.5 MB data. This task requires about 4-5 man months of a qualified secretary with a data
encoding background.(b) Annual data
41. As indicated above about .8 MB data shall be received in printed form from member States and other sources which requires keying-in, verification and editing. The task is of a continuous nature and therefore requires a permanent staff.
4.5 Data-collection through other means
42. Data from countries with advancing data bases and international institutions is planned to be received through magnetic media i.e. diskettes, CD ROM and Electronic bulletin boards. Data gathering through electronic means and transcription into the ECA's
data bases format requires highly qualified personnel with strong systems and programming background.The estimate not based on any study or past experience. Thus indicate only possibility.
4.6 Data variable description and data base structure
43. The initial data base shall consist of data and reference files described in chapter 4.1 above. Once the data is entered and fully edited a lot of archiving, and data
organization shall be undertaken.44. In order to satisfy the needs of experts of varied background both at ECA, national, regional, subregional and international organizations the transport data shall be organized
in various ways.
4.7 Information distribution
45. Similar to data collection the distribution of transport data to users shall be
undertaken through various means.
46. Some of the major means are listed hereunder.
(a) Experts in TCTD and other selected professionals in ECA shall have direct access
to the data base through the local and wide area networks. These experts shall have the capability to display data on their terminals get printouts or manipulate data according to their needs. The data base can also be downloaded to diskettes for any interested users.
(b) Advanced national focal points of member States could have direct on-line access or use a bulletin board system to download data to their computers.
(c) Compact disk-read only memory (CD-ROM) technology could be employed to
distribute data to users. End users of CD-ROM data should have the necessary equipment so as to enable them use the system(d) Periodic statistical publications in a form of books, brochures and printouts shall constitute the major media of information distribution. The advance in desk-top publishing
has created the necessary environment for developing qualitative printouts and therefore
its use shall be maximized both qualitatively and qualitatively.5, Recommend manpower and infrastructure requirement and impiementation plan 47. The basic resources required to run the system include computer hardware, software and technically oriented manpower. The overall study is still in progress and, therefore; only the highlights of resource requirement are presented in this chapter.
5.1 Hardware
48. The overall hardware shall ;be on a distributed environment where users in TCTD
shall be provided with the necessary micro computers and other user friendly toolsenabling them to have direct access and manipulation power of the data base. The
micro-computers shall be linked to the division's server and the division server shall be
linked to ECA-wide common server. In order to enable remote access both the work stations and the servers shall be equipped with modems.49. The following list indicates the major equipment required for the Transport data
base. ■ ,
Data base server
50. The main data base server should be powerful pentium 66 MHz processor, equipped with 64 MB random access memory (RAM), 2.6GB or above hard disk, built in CD-ROM drive and appropriate back up facilities. The machine shall be attached to high resolution scanners and high speed laser printers.
Work stations
51. About 16 work stations to be linked to the data base server are required. The existing micro-computers must either be up graded so as to provide faster response time and storage space or should be changed completely.
5.2 Software
52. Both the servers and workstations shall operate under UNIX environment. The work station shall be pre-loaded with latest version of windows, graphic software, and the necessary application software to be selected at the later part of the study.
5.3 Manpower and organization
53 Development, maintenance and administration of the envisaged data base requires the assignment of full time professionals of varied background.
fa) Development of harmonized methodologies for data collection and processing
54. The project involves data collection from member States, subregional, regional and international organization. The data to be collected should be in a harmonized manner.
This involves the implementation of data definition, and collection methodologies in member countries for the transport sectors for which such manuals have already been developed. In addition methodologies and manuals ought to be developed for air and urban transport.
55. The functions outlined above requires deployment of Data and Systems
harmonization report with strong background in transport statistics.(b) System development, maintenance and administration
56. The project involves the task of undertaking systems development, and maintenanqe and administration which includes standardization of forms, flow of data, processing outputs, hardware, software, systems analysis, systems design, programming, management of an overall operating environment, administration of LAN's, development of linkages between data bases of varied origins, structures and systems.
57. Most of the functions outlined above can be undertaken primarily by Information
Systems Section. However, due to specific characteristic and special attention required
for the transport data base project TCTD should be able to build within itself a capability to undertake sector specific systems development, systems maintenance, transport data base administration, administration of the divisions LAN and provision of all the necessary technical support for users within the division.58. The person to be deployed to undertake this function must have strong technical and managerial background with experience ranging from programming, systems analysis, systems design and the overall development, administration and maintenance of data bases. The same individual can serve as the Transport data base unit manager.
(c) Transport data base development, maintenance and administration
59. This involves the administration and maintenance of the transport data base including:
■ new entries and deletion of records;
■ data collection, editing, updating, estimation, maintenance of data dictionaries and reference files;
■ data base administration including general administration, control and security of the data;
■ data distribution and dissemination including statistical bulletin, quarterly and annual statistical publications, and the facilitation of data transmission through electronic means and direct access capability.
60. This function requires deployment of a professional with strong statistical information systems background and sufficient training and experience in data base
administration and management.
61. The data base development and maintenance work involves a lot of routine work such as data entry, editing, checking and similar tasks which requires additional assistants. Thus, junior professional with background in statistical publications in data
base environment should be deployed.62. Thus, it is recommended to create the posts of Senior Systems Analyst, Senior Data Harmonization Expert, Data base administrator and Assistant to the data base administrator. Tentative organization chart for Transport data base unit at ECA is
included in Annex 1.
5.4. Tentative Implementation plan
63. Due to complexity of the project its implementation should be undertaken in two
phases. .
Phase One - Pre-implementation testing
64. This involves:
■ undertaking detailed design of the data base system;
■ identifying and creating the necessary link with few member countries who can and are able to supply data for the testing;
■ identifying and creating the necessary link with International organizations and make arrangement for supply of data;
■ Collecting data from selected member States, subregional and international organizations and undertake full testing of the system;
■ Disseminating the output data in various forms and means and collecting feed back so as to make the necessary amendments for the final implementation.
Phase two - fuli implementation
65. Full implementation involves the extension of the test run to other member States.
Apart from this the implementation shall involve activities such as:
■ institutionalization of continuous data collection at national, subregional levels;
■ establishing strong and reliable data and information exchange network with national, subregional and international institutions engaged in similar activities;
■ Updating of manuals and working documents which will facilitate data interchange function at a minimum possible cost.
66. It should be born in mind that the implementation function of the project shall
involve a lot of visits to selected member countries, subregional and international
organizations. It also involves conducting training oriented workshops and seminars
aimed at exchanging experience and transfer the know how to personnel involved in the
project at various levels.ECA'S TRANSPORT DATA BASE
TENTATIVE ORGANIZATIONAL CHARTAnnex I TRANSPORT DATA BASE TENTATIVE ORGANIZATIONAL CHART
Transport Data Base Unit Manager (Sr. Systems Analyst)
Sr. Systems Analyst Administrator
Asst. Data Base
ECA'S TRANSPORT DATA BASE :
ELECTRONIC TRANSMISSION APPROACH
Introduction
1. The Transport Data Base (TDB) will be used fpr several purposes:
Staff at ECA and especially at the division of Transport, Communications and Tourism (TCTD) as reference for the production of reports and documents in relation with their work program; and
External users (ECA member states, researchers and decision makers) would have to be kept informed on new developments, up-to-date and comprehensive information on transport issues in Africa.
2. The infrastructure in data communications at ECA should accommodate electronic access to the Transport Data Base. The existing communication links comprise of
(i) an Alternate Voice/Data (AVD) line, between UNHQ, New-York, and UNECA, Addis-Ababa, operating at 9.6 Kbps in data mode for 3 hours on working days and in voice mode for fax and telephone communications for the rest of the time;
(ii) and public telephone lines for computer to computer asynchronous communications.
While online access to the Transport Data Base will be easily available to the local user at ECA, some distant users equipped with PC and appropriate communications software should be granted online access. Electronic transmission between some African countries is generally inefficient and expensive at present. Transport Data Base can be downloaded on any magnetic media and transferred offline to the end-user.
Some Developments of General Interest
3. A major development in the secretariat has been the installation of Local Area Networks (LANs) for several Divisions. The LAN system will be extended to all of ECA, including the TCTD division, when the new Digital Telephone Systems (PABX) is installed, some time early in 1995. With the PABX new communications cabling will also be installed and will provide the basic infrastructure for both voice and data at high speed.
4. A satellite earth station system is expected to be installed with consequent improvements in telecommunications. This station is one of the nodes of the UN global telecommunications Network being installed organization-wide. With regard to the data communications, a packet switch capable of providing asynchronous and synchronous communications with different speed and X.25 protocol support, will be put in place.
5. The E-mail network for the United Nations, which is based on cc:Mail software, allow cc:Mail users to exchange mail with registered e-mail subscribers in other UN agencies as well as with people who have e-mail addresses in other organizations. Linkage with other e-mail systems is provided through Internet and X.400 gateways. These gateways allow users to exchange e-mail with individuals outside the UN who have Internet and/or X.400 addresses.
The basic unit of an electronic mail transaction is the message, which could vary in length from one-line to several pages. A message can be accompanied by an attachment, that is, any file which can reside in the PC, such as WP document, a spreadsheet; a data base file, or an image format file.
Application Access
6. Transport Data Base can be accessed on line by dialling in through a modem or by means of using a bulletin board system (BBS).
Telecommunications costs in some countries in Africa are still very high and on-line searches of Transport Data Base might be very expensive.
In the case of BBS option requests could be posted in a mail box at TCTD, and TCTD management would process the requests and post in return the results in the user's mail box.
This approach could be used to exchange information such as data files produced at the national level.
Fax/Telex - from - PC
7. Texts and printed materials on Transport information generated from the Data Base could be sent directly from PCs as faxes to user. For hard to reach users, these materials could also be sent as telexes.
Others Types of Communications
8. Stand alone fax machines and Telex might be used to exchange texts materials generated on PC or on paper. The traditional Postal services are required in sending not only diskettes but also any magnetic media containing transport data, printouts, or publications.
DISSEMINATION ISSUES
national statistical offices (NSOs). this information after passing through routine compilation procedures , known as ^statistical systems" is fed into information systems geared towards planning monitoring and evaluation of national social and . economic development. Regional and international agencies are also engaged in compilation of usually macro-level / statistical series in order to provide comparative country profiles containing socio-economic indicators of development required for harmonization of regional economic policies, a necessary condition for promotion of regional groupings.
2. Due to an acute shortage of resources available at the national statistical offices of the region data needs of the public at large are rarely assessed and efficient and timely dissemination through traditional publications in itself is considered a major achievement.
3. Any type of statistical compilation, whatever the complexity in the steps involved... data gathering, editing, aggregation, statistical analysis, estimation and inferences etc., normally must .lead to dissemination of the compiled data in one form or another to the target users. From the outset it may be important to appreciate the difference btween two closely related processes namely data distribution and data dissemination.
2.0.0. Data distribution
4. Data-distribution, refers to the distribution of micro-level data (cleaned, recoded, "standardized" and structured and accompanied with at least the minimum data descriptions etc), to the other users, usually within the data-generating organization or closely associated to that organization. The data structures are normally rectangular in case of semi-processed distribution, but in the case of fully processed distribution these structures may be hierarchical, relational etc., depending upon the data base management system (DBMS) technique used.
5. Some of the data-distribution practices applied by the national statistical offices (NSO) may be data
distribution between NSOs and their regional offices,
line ministries, banks and other bonafide researchers
etc., for further processing and analysis and eventual dissemination.level data are distributed but in certain cases micro- level information has also been distributed. One example which may easily be cited is that of- Dynamic Demographic Data Base (DDB) maintained by the International Statistical Institute (ISI) where fertility and other demographic data are archived in the form of Standard Recode Files (accompanied by the necessary data transformations, descriptions and file , structure details etc), relating to around fifty countries. Through agreement, these files are distributed to the interested researchers world-wide for further analysis and dissemination in the form of technical papers etc.
3-0-0- Data dissemination
7. By definition data dissemination involves provision to the end-user the data aggregates in the form of a statistical presentation accompanied by the relevant meta-information. Dissemination requires further processing of data before the compiled data may be released to the target users in the form of aggregated data matrices, tabulations, graphics, maps and other statistical presentations.
8. As opposed to the information relating to
individuals, for inferential purposes it is the data
relating to groups which is usually released.Dissemination is usually a final step,in the process of data compilation resulting in presentation of macro-level information aimed at a market much wider than statisticians, planners and researchers alone, including among others students, pressure groups, marketers, advertising firms, politicians and the mass media etc.
9. Various methods used for data-dissemination are
discussed below, while other related considerations
which may need to be taken into account are listed inthe section 4.0.0.
3-1.0- Dissemination products i
10. There are various modes statistical data may be presented to the users. Depending upon the user needs and the technical complexity data may be disseminated as data-bases, tabulations, graphics,, maps, pamphlets
3.1.1. Statistical publications
' ! ' ■ '■ '■■
11. Periodic (monthly, quarterly and annual) statistical publications disseminated in the form of books, brochures and pamphlets and printouts are the basic responsibility of any statistical office. Due to the advances in the micro-computing and particularly desk-top publishing techniques, the quality of these products can be vastly enhanced.
However large-scale tabulation printing still may be more efficiently produced through the traditional printing procedures. Many of these printing machines have themselves been upgraded to accommodate the state-of-the-art processes and eliminating a number of intermediate steps involving time-consuming photo- reduction lithography etc. For small scale printing and specially when graphics work is also disseminated, colour printers may be utilized most effectively.
3.1.2. Thematic publications and social atlases
12. Country/district profiles, summary data supported by graphics and text can prove to be very popular.
Pictorial presentation of statistical data relating to cities, rural/urban comparisons (and internationally for_ countries and regions) providing indicators of socio-economic development may be disseminated to a wide market.
3-1-3. Haos and charts
13. Maps showing country/district boundaries and filled with the relevant information in the form of bars and pies are another type of simple but attractive dissemination of information.
3.1*4. Microfiche
14. Mostly, demand for this product is in the form of tables and matrices. This type of dissemination is primarily aimed at reference work and data extraction
; for secondary dissemination can be cumbersome.
15. ^ It is one of the early electronic dissemination media used for distribution and dissemination of Targe-scale data sets in the form of matrices along with the relevant meta-information. Depending upon the data structures and storage techniques used magnetic tapes can provide capacity for storing over 150 MB of information. Magnetic tapes are not only bulky for transmission but usually these may only be used on mainframes and mini-computers and often need purpose-built computer programs to view, extract and further process the information contained on them. l
(b) Floppy disks
16. Since the advent of the micro-computer and particularly since the wide acceptance of "DOS^- standard" operating systems, the dissemination of . . small to medium size data-sets has been effectively carried out through this medium, mostly utilizing spreadsheet or data base micro-computer utilities.
The data so supplied can not only be viewed on screens and printed, but also may be further processed or merged with the host data without much programming effort.
(C) CD-ROM
17. Compact Disk - Read Only Memory (CD-ROM) which like a floppy disk is much less bulkier than the magnetic tapes but provides much larger storage : : capacity (often over 600 MB) and its application like floppy-disk is much wider than magnetic tapes. A special CD-ROM drive (not too expensive) is needed to read such disks, however creation of such disks is much more expensive. IMF over the recent past has conducted an extensive research looking into the prospects of dissemination of their statistical products (which used to be distributed through magnetic tapes) through CD-ROM and an excellent paper on the topic has been released by IFS. Currently, parallel with its traditional practice of tape distribution, IMF is disseminating much larger and more complete data sets regarding IFS, GFS, DOT, and BOP through this medium.
possibility of updating and expanding the data according to their own needs may provide a good tool for dissemination to the institutions, engaged in statistical compilations or research. Dissemination modes of data bases are usually menu-driven necessarily with an extensive amount of meta- information and documentation. World Bank's STARS (Socio-economic Time series Access and Retrieval System), FAOs AGR9SSTAT and on a much larger scale IMF's periodic CD-ROM distribution are some of the examples of such dissemination. Statistics Sweden uses PC-AXIS for dissemination of their census data sub-sets.
3.1.7- Direct Access
19. Dissemination could also be achieved through establishment of direct access linkages via communication networks where user can tap into the information by connecting to the host computers through telecommunications and modems. In Western Europe and North America a whole host of such
information is disseminated to the user's offices and
homes. Here, depending upon the scope of user's subscription the users are provided access to only the agreed types of information.20. Electronic Data Interchange (EDI) is a special application of direct access methodology. Through establishment of reliable communications network EDI may be extensively used for exchange of data between various national offices inter-government, as well as international agencies.
4-0.0. Dissemination considerations 4-1-0. Primary dissemination
21. When the data compiled by the data generating agency is disseminated by itself it is termed as primary dissemination. Most of the dissemination carried out by the NSOs and international agencies falls into this category.
processing, data transformation and presentation changes etc. However, if this data is disseminated in its original primary dissemination form then it will be termed as secondary dissemination and special permission from the primary disseminator must be obtained for such dissemination. , ■,
4-3.0- Timeliness
23. One of the major objectives of data compilation has to be timely dissemination of the compiled data for appropriate planning and decision making.
Population and housing censuses are taken for wide- ranging medium to long-term country planning, but in the African region typically, the detailed census results are disseminated after two to five years delay. Adequate resources must be made available for timely and exhaustive dissemination of any kind of statistics needed for planning.
24. A casual user of statistical information does not appreciate long delays in data compilations and refinements and would rather prefer quick and easy access to rudimentary statistics. More "accurate"
data is essential for completion of time series and model-building but an alternative "second-best"
product (like the preliminary census results etc.), should be made available without a longish time-lag.
4.4-0- Conf identialifcy
25. As discussed earlier, normally only aggregated data are released for dissemination. However, when need arises micro-level data related to individuals and enterprises etc., may only be released when it is certain that individual units, cannot be identified through association of information.
4-5-0- Marketing of dissemination products
26. In the African region, so far very little effort has been directed towards marketing of the statistical information to the potential users. Lack of dissemination products through delay in processing of
information has been a major factor in not exploiting
advertising firms, mass media,, libraries and students
etc. ;
; 27. Conferences, seminars, workshops and advertising are essential in iaunching of new statistical dissemination products. In this connection
monthly/quarterly statistical product bulletins (appetizers!) providing details such as:
New product (highlights);
topic, period and scope of coverage;
- mode of availability;
subscription details, etc., are of vital importance.
4-6.0. Product planning and dissemination target
28. Most of the current dissemination products are being produced due to the legislative demands expected of these agencies. Alternative/additional products have to be carefully designed to meet the user needs of the target group. For example CD-ROM product for the type of user who is interested in manipulation of massive amount of information and of course the user has to be equipped with the necessary tools to fully utilize the disseminated product,
29. There are many potential users who are not aware of existence of and the ease of access to various types of statistics which they can be interested in.
National statistical offices must evaluate their potential market and according plan the dissemination products, for example pamphlets and pocket-sized booklets for the casual user and more detailed products for the researchers and statistical departments of other ministries.
30. Apart from dissemination to the international agencies as part of mutual agreements, international and regional dissemination may successfully be utilized for instance for export promotion, to attract foreign investments and for development of intra- regional trade and consequently regional economic
groups.
associated definitions, footnotes, limitations/constraints, data sources, methodologies (regarding projections and estimates, and missing data etc.), classifications, labels and titles and if distributed in the form of dissemination data-bases, all the necessary documentation, instructions and related computer programs etc. Provision of complete meta information relating to the data becomes doubly important when micro-level data is distributed in its semi-processed form and is expected to be further processed and disseminated. It is important that
internationally accepted classifications are used and if special circumstances dictate the usage of non- standard classifications then it is important that
"conversion-tables" with sufficient explanations are also disseminated so that the information may be interpreted correctly.
4-8.0- Data-exchange interfaces
32. Traditionally, the data dissemination being the 11 end-product", of the chain of compilation procedures it has not been planned as a product suitable for re- compi1ation but lately due to extens ive usage of micro-computers and the ease of data portability across various systems the disseminated products have frequently been used as input to the other host data bases.
33. Now-a-days distributed products are also accompanied with some sort of "linkage kit" which enables transfer of data between the disseminated product and other data bases for completion or comparison.
4-9.0. Computer viruses
34. Finally, due to the ease of wide-ranging dissemination specially through the electronic media, a word of caution regarding of-discussed computer viruses must be made. One must always be prudent in ensuring that the floppy disk received are virus-free before embarking upon copying and using these disks onto the host computers.
DATA / INDICATORS TO BE INCLUDED IN
TRANSPORT STATISTICAL YEARBOOK [ROPOSAL]
I. GENERAL INFORMATION
1.1 Basic Indicators 1. Estimate of mid year population (millions)
2. Total area of the country (thousands of square kilometers) 3. GNP per capita (us$)
4. Average annual growth rate of GNP per capita (%) H. ROAD AND ROAD TRANSPORT
2.1 Inventory Statistics 1. Total length of road network (km)
■ 2. Length of classified road network (km) 3. Length of paved road network (km) 4. Length of unpaved road network (km)
5. Vehicle in use by classes of vehicle (number and ton).
6. Number of road transport controls on main roads 7. Total number of employees
8. Number of professionals 9. Number of technicians lO.Number of workers (labourers) 11 .Number of equipment by category
12.Vehicle loading capacity by classes of vehicles (number and ton).
13.Number of driving schools.
2.2 Traffic Statistics 1. International freight traffic (tons and ton km) 2. National freight traffic (tons and tkm)
2.3 Performance indicators 1. Proportion of good, fair, poor for classified network (%).
2. Spatial road density (kin/km3) 3. Per capita length (km per person)
4. Average annual growth rate of paved roads (%) ,5. Average annual growth rate of unpaved roads (%)
6. Average annual growth rate of classified road network (%) 7. Average annual growlh rate of totaf road network (%) 8. Average annual growth rate of spatial road density (%) 9. Average annual growth rate of per capita length of road
(*) " .
10. Number of worker per length of road maintained (number/km)
11. Professionals per total employees 12. Technicians per total employees (%) . ■
13. Equipment rented by highway authority from highway authority (cost and percentage) ■ ■
14. Average age of equipment items (years or months) 15. Proportion of owner operators of the total fleet (%).
16. Proportion of companies of the total fleet (%) 17. Proportionof own account of the total fleet (%).
18. Proportion of parastatals of the total fleet (%) 19. Vehicle fleet age by type (years).
20. Load factor (%)
21. Transort delay for major international roads (hrs or days) 22. Number of deaths per 10,000 vehicles oh road 23. Number of deaths per 100,000 inhabitants
Transport Statistics year hook
Transport Statistics year book
Transport Statistics year book
Transport Statistics year book
7. Periodic maintenance cost (amount per km) S. Routine maintenance cost (amount per km) 9. Construction cost per total budget (%) 10. Rehabilitation cost per total budget (%) 11. Periodic maintenance cost per total budget (%) 12. Routine maintenance cost per total budget (%) 13. Personnel expenditure per total budget (%)
14. Construction and maintenance expenditure per total budget (*),.-.
15. Actual per required for the four categories and total expenditure (%)
16. Total amount of investment on force account (US$) 17. Total amount of investment on contracting (US$) IS. Average annual growth of contracting (%) 19. Total amount of sub-contracting cost (US$) 20. Ratio contracting per total amount invested (%) 21. Donors contribution per total for construction maintenance and rehabilitation (%)
22. Road general sufficiency (%) 23. Road user sufficiency (_%) 24. Fuel taxes per fuel prices (?6)
25. Road transport user charges (US$) >
26. Budget allocation for road transport sub-sector (percentage of total budget)
27. Percentage of spare parts in operating costs by classes of. .
vehicle (%) .
28. Current road transport prices or tariff for major routes, in particular for international road (US$ per tkm and pkin)
5. Number of available wagons by type 6. Number of available wagons by cargo capacity 7. Number of available wagons by capacity 8. Number of available wagons by by age 9. Number of available coaches by lype 10. Number of available coaches by capacity 11. Number of available coaches by seating capacity*
12. Number of available coaches by age 13. Total Number of staff
3.2 Traffic Statistics 1. Total freight ton kilometer (tkm) 2. Total passenger kilomettcr (pkm) 3. Total seat kilometer (skm)
4. Average gross trailing load of freight trains (tons) 5. Average net load of freight trains (Ions)
3.3 Performance indicators 1. Traffic units per km of railway route (TU/km) 2. Passenger train occupancy ratio (%).
3. Locomotive availability (%)
4. Locomotive reliability-km between failure (KM) 5. Locomotive utilization factor
(km/day)
6. Annual kilometrage per available locomotive (km)
7. Locomotive total output utilization factor (tkm/loc x hrs)
8. Locomotive productivity (TU/loc x hrs)
9. Wagon availability (%)
10. Wagon turnaround-number of days between two successive loadings (days) 11. Annual kilometrage per available wagons (km) 12. Average wagon load (tons)
13.Empty back haul factor (%)
14. Wagon productivity indicator (tkm/total capacity of the fleet) 15. Coach availability (%)
16. Annual kilometrage per available coach (km) . 17. Traffic units per staff number
3.4 Financial Statistics 1. Passenger revenue as total of revenue (%) 2. Freight revenue per ton km (US$) 3. Passenger revenue per seat km (US$) 4. Revenue per passenger km (US$) 5. Working ratio (%)
6. Operating ratio (%) 7. Operating income (%)
8. Net income (US$) .'
9. Return on Asset (%)
Transport Statistics year book Transport Statistics year book
Transport Statistics year book
Transport Statistics year book
i
4.2 Traffic statistics
2. Capacity of aircraft by type
3. Number of aircraft by use (passenger, convertible, all-flight and others not used for commercial air transport)
4. Air carrier personnel by type of job (flight personnel, other personnel).
5. Number of aircraft accidents, fatalities and injuries
a) AIRLINES
1. Passenger, freight and mail traffic by type of service
- International scheduled airlines - Domestic scheduled airlines - International and domestic non- scheduled operators
b) AIRPORT
1. Airport traffic by type of aircraft movements (international take-off or landing, domestic take-off or landing)
2. Airport traffic by type of service
- Passenger traffic embarked, disembarked and direct transit
- Freight traffic (loaded, unloaded) -Mail (loaded, unloaded)
3. Airport traffic by type of journey or movement
- International journey or shipment - Domestic journey or shipment
"
« - » «
„ .. ,, .„ „
■ .
i
4.4,Financial Statistics
1. Passenger load factor 2. Weight load factor
3. Aircraft utilization (hqurs flown per aircraft)
4. Tonne kilometer performed (TKP) per employee 5. Tonne kilometer performed (TKP) per flight crew member
6. Block hours per crew member
7. Number of passengers carried per crew member Average stage length ft.) AIRPORT
1. Passenger per employee
a) AIRLINE
1 Yields per tonne kilometer performed (US$)
2. Operating cost per tonne kilometer available (US$) 3. Expenditureper passenger (US$).
4. Income per passenger (US$)
Transport Statistics year book - " - ., n
M „ .
.. . ,. n
-
i. It i .. ti
1. Income per passenger (US$) 2. Expenditure per passenger 3. Trading profit (operating revenue less operating expenditure) per passenger
4. Aeronautical income per
passenger
5. Non-aeronautical income per
passenger
6. Income per employee 7. Value added per employee
8. Capital expenditure per
passenger
9. Net assets per employee
Transport Statistics year book
■5.1 Inventory statistics
5.2 Traffic statistics
5.3 Performance indicators
5.4 financial Statistics
1. Number of ships operated
1. Total quantity of goods yearly carried by the fleet operated
1. General ship turn-round in day/ship
2. Specific ship turn-round in day/ship by type of fleet yearly operated by shipping company 3. General physical ship productivity in tons 4. Specific physical productivity of category of merchant fleet
1. Financial productivity of merchant fleet
2. General ship operating cost per quantity of good operated
- /
3. Specific ship operating cost per quantity of good operated
4. Shipping company yearly financial performance indicator (yearly cast flow of the company)
Transport Statistics year book
■ - -
...
5.5 Inventory statistics
5.6 Traffic statistics
1. Number of ports 2. Number of berths by port
3. Number of vessels by type (general cargo, container carriers, dry-bulk and liquid- bulk)
4. Number of ship calls by type of vessel (general cargo container carriers, dry-bulk and liquid-bulk)
5. Rate of frequency (calls per ship)
6. Number of calls by type of vessel (General cargo, container carriers, dry-bulk and liquid- bulk)
1. Total outbound traffic (tons)
2. Transit traffic out of the tolal outbound traffic (tons) 3. Total transhipment (tons) 4. Total transit (tons) 5. Total container traffic - domestic + transit + transhipped (TEUs)
Transport Statistics year book
■■
" .
'•
i'
5.8 Financial Statistics
,\
2. Average waiting time in berth before berthing (hours)
3. Average waiting time in berth after berthing (hours)
4. Average time at berth (hrs or days)
5. Average tons loaded per ship in berth per day (tons/ship/day) 6. Average tons unloaded per ship in berth per day (tons/ship/day)
7. Handling performance per working hour (tons/gang hour)
8. Tonnage handled per linear meter of quay (tons/meter) 9. Average daily handling performance by berth for containers (TEUs per ship in berth)
10. Degree of eontainerization
<*)
11. Average dwell time for container (days)
1. Working ratio (%) 2. Operating ratio {%) 3. Operating income (US$) 4. Net income.(US$) 5. Return on asset (%)
„ " ,. „
„ . n „
-
„ „
it n .1 ii
■■ '