|
The software for analyzing the export data for the purposes of ECDB has been developed by the Center of Development of Advance Computing (CDAC, formerly knows as NCST), which is a Government of India Organisation under the Ministry of Information and Technology.
The software has been designed for data analysis by clustering the raw data under predetermined fields. The clustering mechanism makes use of the description of goods (key words) HS/ITC Code, Unit Quantity Code(UQC), country of destination, DBK Serial Number, DEPB Schedule number, DFRC Schedule number, etc. The software based on their repeated occurrence in identified HS/ITC Codes selects the key words for the purpose of data analysis.
The software provides adequate flexibility to choose the commodities, which should be subjected to analysis, based on one or more of the parameters described above. On the basis of identified key words, the software runs an analysis to calculate the weighted average value per unit and standard deviation from weighted average for each consignment. The software is also capable of marking those cases, which fall outside the sum of weighted average and standard deviation as outliers of that cluster. The software also allows one to choose the frequency of the data analysis based on the date of the shipping bills or the date of let export order.
The raw data are presently received from 21 EDI Stations and consists of 1.5 lakhs records approximately per week (about 60 MB). While it will be possible for the software to analyse the entire data, it is proposed to limit the data analysis to those commodities which are sensitive to overvaluation so that analysed data results would be more meaningful and specifically targeted towards overvaluation of sensitive goods.
|