# Get the files from the PO website use Geo::Postcodes::JP::Update /update_files/; # Parse the files use Geo::Postcodes::JP::Process /read_ken_all_csv read_jigyosyo_csv/; # Searching things use Geo::Postcodes::JP qw/postcode_to_address address_to_postcode/;
Check the postcode download site for updates to the files.
Download the updated postcodes.
Parse the KEN_ALL.CSV and jigyosyo.csv files; eliminate the problems from the files. Change the "sonota" entries and "ika ni nai baai" entries somehow. Turn the multiline entries into single entries. Handle the duplicate cases. Handle the cases of business postcodes which don't correspond to entries in the main file (have a different address which is not in the address file). If possible convert the inconsistent "(kabu)" and "(kabushikigaisha)" entries. Convert the half-width katakana entries into standard kana (use MeCab module?).
Put the postcode data into a SQLite database or other handy format.
Offer searches of the postcode data.
As is well-known, the Japanese postal code data files as supplied by the post office have some problems. A major goal of the module is to take the "bad" file of the post office and turn it into a "clean" file of some kind.
The output file might be .CSV or SQlite, or maybe both kinds of file.