WO2016051810A1 - Device, method, and program for generating geographical space information - Google Patents

Device, method, and program for generating geographical space information Download PDF

Info

Publication number
WO2016051810A1
WO2016051810A1 PCT/JP2015/005044 JP2015005044W WO2016051810A1 WO 2016051810 A1 WO2016051810 A1 WO 2016051810A1 JP 2015005044 W JP2015005044 W JP 2015005044W WO 2016051810 A1 WO2016051810 A1 WO 2016051810A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
geospatial
facility
geospatial information
original
Prior art date
Application number
PCT/JP2015/005044
Other languages
French (fr)
Japanese (ja)
Inventor
石川 豊
Original Assignee
株式会社ナイトレイ
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社ナイトレイ filed Critical 株式会社ナイトレイ
Priority to JP2016551561A priority Critical patent/JPWO2016051810A1/en
Publication of WO2016051810A1 publication Critical patent/WO2016051810A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to an apparatus, method, and program for generating geospatial information, and more particularly, to associate a particular point or area in space with original information available on an information distribution site, Apparatus, method, and program for generating spatial information
  • Twitter posts have exceeded 2 billion a month in Japan alone.
  • the contents include conversations with friends, tweets, information sharing such as news, meals, and event participation.
  • posting to the SNS includes information closely related to the place to which the user is away, etc., and can be considered as geospatial information.
  • geospatial information means information associated with a specific point or area in space.
  • the "information distribution site” widely includes sites for distributing information on the Internet, such as search sites, as well as SNS.
  • the present invention has been made in view of such problems, and an object thereof is to efficiently map geographical information by associating a specific point or area in space with original information available on an information distribution site.
  • Abstract An apparatus, method and program for generating spatial information.
  • a first aspect of the present invention relates to generating geospatial information by associating a specific point or area in space with original information available on an information distribution site.
  • the geospatial information generation apparatus according to claim 1, further comprising: a collection unit that collects the original information; and an addition unit that adds facility information related to the original information to the original information to generate geospatial information.
  • the assigning unit assigns the facility information based on a URL included in the original information.
  • the collecting unit collects only the original information including the URL.
  • the collecting unit makes a request by giving a character string of “http:” or “https:” to an API of the information distribution site and making a request. It is characterized by collecting information.
  • the giving unit scrapes a source code of a website displayed by the URL to give the facility information. It is characterized by
  • the assigning unit determines the web service that provides the original information to the information distribution site before the scraping.
  • the facility information is a facility name.
  • the longitude and latitude is acquired with reference to a facility database which holds the correspondence between the facility name and the latitude and longitude based on the facility name. I assume.
  • An eighth aspect of the present invention is characterized in that, in the seventh aspect, the geospatial information includes an address obtained by reverse geocoding the latitude and longitude.
  • the reference range of the facility database is estimated to be the location area of the facility having the facility name based on a part of the original information. And the facility database is referred to.
  • the facility information further includes a facility type.
  • the geospatial information includes a facility name and a facility type or a facility name and a latitude and longitude.
  • the twelfth aspect of the present invention is characterized in that, in any one of the first to eleventh aspects, the information distribution site is an SNS.
  • the collection and the application are performed in real time.
  • the transmitter further includes a transmitter configured to transmit the geospatial information to the mobile terminal, the transmitter being a current position of the mobile terminal Transmitting the geospatial information about a facility within a predetermined distance from.
  • At least one past or present geospatial information associated with a specific point or area is aggregated to obtain a score. It is characterized by calculating.
  • a sixteenth aspect of the present invention is characterized in that, in the fifteenth aspect, the score is calculated according to a time zone or a time.
  • the score includes an amount of the geospatial information associated with the specific point or area, and a photograph includes each piece of geospatial information. It is characterized by being calculated by aggregating by weighting according to whether or not
  • the score is a character including the amount of the geospatial information associated with the specific point or area in each geospatial information. It is characterized by being calculated by aggregating by weighting according to the language of the column.
  • a nineteenth aspect of the present invention is a geospatial information generation method for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site. And collecting the original information, and adding facility information related to the original information to the original information to generate geospatial information.
  • a twentieth aspect of the present invention causes a computer to execute a method for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site.
  • Program for generating geospatial information comprising: collecting the original information; and adding facility information related to the original information to the original information to generate geospatial information. It is characterized by including.
  • the facility information related to the source information is added to the source information available on the information distribution site based on the URL included in the source information, thereby providing high reliability.
  • Facility information such as a facility name can be added to the original information to efficiently generate geospatial information associated with a specific point or area in space.
  • FIG. 1 is a schematic view of a geospatial information generation apparatus according to the present invention. It is a figure which shows an example of the original information collected from an information distribution site. It is a figure which shows an example of the response data from API of Twitter.
  • the geospatial information generation device is configured by executing a program for realizing each process performed by the device on a computer, and the computer is not limited to a single computer.
  • FIG. 1 shows a geospatial information generation apparatus according to the present invention.
  • the geospatial information generation apparatus 100 generates geospatial information by adding the facility information related to the source information to the source information and the collection unit 110 for collecting the source information available on the information distribution site. And a provision unit 120.
  • the original information collected by the collection unit 110 from an information distribution site such as Twitter is displayed as shown in FIG. 2, for example.
  • the source information 200 includes a user ID 201, a posting time 202, a place name 203, a URL 204, an area 205, a word of mouth 206, and the like.
  • the collection can be performed by requesting a character string characterizing the URL from the API when the information distribution site provides the API for keyword search. Thereby, only the original information including the URL can be collected. By giving "http:”, "https:”, etc. as a character string to the API, the original information including the URL can be efficiently collected.
  • the original information available on Twitter that is, about 5% of postings including URLs among the postings, according to the inventors' investigation, and in one embodiment of the present invention, most of the posting data is discarded. They are actively excluded from the analysis target and appropriately selected those capable of generating geospatial information.
  • the response from the API may include original information to which facility information can not be associated, and in order to exclude these, it is possible to analyze the collected original information (response) data.
  • the source tag is analyzed and limited to those including a URL issued by a predetermined web service (foursquare, instagram, etc.), or the text tag is analyzed to “(1 half-size space) htt (p ), “(1 full-width space) htt (p)”, etc. can be filtered (see FIG. 3A).
  • the assignment unit 120 can assign facility information related to the original information based on the URL 204 of the original information 200.
  • the giving unit 120 can access the URL 204 and scrape (web scraping) the source code of the web site displayed by the URL 204 to extract the facility information.
  • the source information 200 has a display of the place name 203, and although the facility name can be determined based on this, it may be shortened, and the reliability of whether this is the correct facility name is Not necessarily expensive. In addition, there may be no display of the place name 203 in the first place. In one embodiment of the present invention, it is possible to obtain facility information such as more reliable facility names by tracing the URL 204.
  • the collection in the collection unit 110 and the assignment in the assignment unit 120 can be performed in real time.
  • Some information distribution sites generate the original information in seconds, and the need for such information also occurs in a short period of time and disappears, so the freshness or immediacy of the information is extremely important. Therefore, real-time processing that repeats collection and assignment in seconds is required. For example, there is a need for real-time processing of information on disasters, disaster prevention, etc.
  • the processing speed can be increased by multiplexing request sources to the API.
  • the geospatial information generation device 100 may further include a transmitting unit 130 that transmits the generated geospatial information to the mobile terminal.
  • the transmission unit 130 can transmit geospatial information on facilities within a predetermined distance from the current position of the mobile terminal.
  • the mobile terminal can obtain information closely related to the current position by browsing the received geospatial information or displaying it on a map. For example, if the original information includes word-of-mouth information, it can be applied to car navigation systems, navigation services for pedestrians and bicycles, regional information services, etc., and not simple maps, route information, etc. It can be easily informed of the existence of a facility that is attracting attention in the surrounding area.
  • the source code to be scraped depends on the web service that provides the original information to the information distribution site, the facility information associated with the original information, specifically the facility name, the latitude and longitude, the facility type, etc. can be analyzed It is described in the form. Since the form in which the description is made may differ depending on the web service as described above, it is possible to determine the web service and then select the scraping logic corresponding to the web service. Furthermore, even if the source information is provided from the same web service, the source code description method may differ depending on the specific content of the source information, such as the presence or absence of an image in posting to SNS, for example. It is also possible to prepare scraping logic corresponding to specific contents and select it.
  • the determination of the web service refers to the correspondence table holding the correspondence between the character string and the web service, and the source tag of the response data collected by the collection unit 110 (FIG. 3).
  • (A) can be performed based on the character string included in (a).
  • the latitude and longitude may or may not be determined by scraping the source code, and if it can not be determined, the association between the facility name and the latitude and longitude is retained based on the determined facility name. It can also be determined by referring to the facility database. At this time, the reference range of the facility database is limited to a range estimated to be the location area of the facility based on a part of the original information, for example, the area 205 such as an administrative division, and the degree of longitude included in the response data.
  • the determination accuracy can be enhanced by referring to the facility name. By doing this, it is possible to prevent, for example, determination as a facility which is the same as the facility name but is greatly separated.
  • the area 205 is described in, for example, the response data of the original information as shown in FIG.
  • the facility type can not be determined, it can also be determined based on the determined facility name, with reference to the facility type database holding the correspondence between the facility name and the facility type.
  • the facility type can also be determined by requesting the determined facility name from an API provided by a web service such as an information site about a restaurant, a tourist facility, etc. and analyzing the response data. At this time, the determination accuracy can be enhanced by referring to the facility name after the facility name is compensated based on a part of the original information.
  • the address can be obtained by converting the latitude and longitude (also called “inverse geocoding").
  • all or part of the geospatial information from the past to the present is aggregated to calculate various scores such as congestion degree, popularity degree, topic degree, attraction degree, risk degree can do.
  • scores such as congestion degree, popularity degree, topic degree, attraction degree, risk degree can do.
  • the station name is included in the facility name, by excluding such original information in the evaluation of attractiveness, stations where the number of passengers gets to an average of several hundred thousand per day are simply crowded It is possible to avoid being evaluated as a highly attractive facility despite the fact that you are doing it.
  • the degree of risk can be evaluated by weighting based on whether or not the character string included in the post includes a predetermined keyword related to crisis management. Such various scores are expected to be used in various situations, such as marketing of private enterprises, crisis management in public organizations, and provision of popular information and trendy information to general consumers, tourists, etc.
  • a score for a specific culture area by performing weighting according to the language of character strings included in geospatial information and totaling.
  • post data to the SNS associated with the area should be only for post data that includes photos and contains Chinese characters.
  • filtering it will be possible to provide popular tourist rankings for Chinese tourists.
  • the point at which such analysis of behavioral tendency can be made without preparing in advance such as a questionnaire greatly expands the application of geospatial information.
  • each record can include facility name, facility type, latitude and longitude, address, etc. in addition to the original information. Furthermore, you may add plant
  • the database can further include a user ID, a post ID, a post text, a post image, a date, and the like. Based on these pieces of information, various special purpose databases can be constructed.
  • facility database a database composed of facility ID, facility name, facility type, latitude and longitude, and address can be constructed. For each facility, scores such as popularity may be calculated.
  • the “user database” it is possible to construct a database configured of the attributes of each user by determining the residence area, work area, gender, occupation, age, etc. of the user based on the user ID. If a user database is constructed, for example, instead of linguistic analysis of strings contained in the original information, filtering by the residence area of the originator of the original information is performed for a specific cultural area or birthplace. The degree of attraction can be assessed.
  • Area Area
  • time or time zone weekday or holiday
  • entertainment index eating and drinking index
  • night index travel index
  • accommodation index outdoor index
  • female index male index
  • traveler index residence
  • area feature database It is possible to construct a database composed of the index of foreigners, index of foreigners, etc.
  • For each area present on the earth it is possible to calculate the complex features of the place. For example, it is possible to express the tendency of the consumer's entertainment behavior at the corresponding time of the area by calculating the entertainment at around 3 o'clock pm on holidays in the area with a radius of 1 km in the Shibuya liquid.
  • a time or a time zone and a holiday weekday can be obtained from the posting time 202 and the like included in the original information 200.
  • Entertainment index is obtained by multiplying daily consumer's entertainment facility related activities and user characteristics and facility characteristics using "geography spatial information", "user database” and “facility database” Can. The other indices are also the same.
  • the feature (silhous information) of the person is automatically calculated by combining each index of the area feature database.
  • the area feature database based on the latitude and longitude and time or time zone included in the movement history, an index that takes a high value in a specific area, time or time zone, weekday or holiday is read out, and the person's feature is I know.
  • geospatial information generation device 110 collection unit 120 addition unit 130 transmission unit 200 original information 201 user ID 202 Posting time 203 Place name 204 URL 205 area

Abstract

 The present invention provides a device for correlating a specific place or district in a space with original information obtainable at an information distribution site, and efficiently generating geographical space information. A geographical space information generation device (100) is provided with a collection unit (110) for collecting original information obtainable at an information distribution site and an imparting unit (120) for imparting facility information associated with the original information to the original information and generating geographical space information. The imparting unit (120) is able to impart the facility information associated with the original information on the basis of the URL (204) of the original information. The imparting unit is able to access the URL (204), scrape the source code of the website indicated by the URL (204), and extract the facility information. Although a place name (203) is indicated in the original information (200), the place name is sometimes abbreviated and the reliability of whether this is a correct facility name is not high. In some cases, the place name (203) is not indicated at all.

Description

地理空間情報を生成するための装置、方法、及びプログラムApparatus, method, and program for generating geospatial information
 本発明は、地理空間情報を生成するための装置、方法、及びプログラムに関し、より詳細には、情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための装置、方法、及びプログラムに関する。 The present invention relates to an apparatus, method, and program for generating geospatial information, and more particularly, to associate a particular point or area in space with original information available on an information distribution site, Apparatus, method, and program for generating spatial information
 iOS(登録商標)又はAndroid(登録商標)を搭載したスマートフォン、タブレット等の携帯端末の普及とともに、Facebook(登録商標)、Twitter(登録商標)に代表されるSNS(ソーシャルネットワークサービス)の普及が、日本を含め世界的に、近年急速に進んでいる。 With the spread of mobile terminals such as smartphones and tablets loaded with iOS (registered trademark) or Android (registered trademark), the spread of social networks (SNS) represented by Facebook (registered trademark) and Twitter (registered trademark) It has been rapidly advancing in recent years worldwide including Japan.
 両者の普及は、自宅以外の外出先で感じたこと、発見した物等のさまざまな体験をその場で共有することを可能にした。外出先で、スマートフォンでSNSを利用することにより、その場に行かなければできない体験をテキスト、写真等の形で気軽に投稿ないし発信して、友人等と共有することができる。 The spread of both has made it possible to share on the spot various experiences such as what you felt outside your home and things you found. By using SNS with a smartphone on the go, you can feel free to post or send out experiences that you can not do without going to the place in the form of text, photos, etc., and share them with friends and so on.
 こうした体験の共有は、近年大幅に増加しており、たとえばTwitterの投稿数は、日本国内だけでも月間20億件を超えている。内容としては、友人との会話、つぶやき、ニュース等の情報共有、食事、イベント参加等が多い。 The sharing of such experiences has increased significantly in recent years, for example, Twitter posts have exceeded 2 billion a month in Japan alone. The contents include conversations with friends, tweets, information sharing such as news, meals, and event participation.
 このように、SNSへの投稿は、外出先の場所等と密接に関連した情報を含んでおり、地理空間情報(geospatial information)と考えることができる。ここで「地理空間情報」とは、空間上の特定の地点又は区域に関連づけられた情報を意味する。SNSへの投稿の典型的な活用シーンとしては、投稿されたテキストデータを場所等により検索することで、どういった場所が注目を集めているかを分析することが挙げられる。 As described above, posting to the SNS includes information closely related to the place to which the user is away, etc., and can be considered as geospatial information. Here, "geospatial information" means information associated with a specific point or area in space. A typical application scene of posting to SNS is to analyze what places have attracted attention by searching posted text data according to places and the like.
 しかしながら、テキスト検索によっては価値のある情報を得ることが困難であるのが実際であり、SNSへの投稿の十分な活用は図られていない。このことは、SNSへの投稿のみに限られた問題ではなく、情報流通サイト上で入手可能な情報(「元情報」ともいう。)のうち、特定の地点又は区域と関連づけられている可能性のあるものについても同様のことが言える。ここで、「情報流通サイト(information distribution site)」とは、SNSのほか、たとえば検索サイト等、インターネット上で情報を流通させるためのサイトを広く含む。 However, it is actually difficult to obtain valuable information by text search, and sufficient utilization of posting to SNS has not been achieved. This is not a problem limited only to posting to SNS, but may be related to a specific point or area among information available on the information distribution site (also referred to as "original information"). The same thing can be said for certain items. Here, the "information distribution site" widely includes sites for distributing information on the Internet, such as search sites, as well as SNS.
 本発明は、このような問題点に鑑みてなされたものであり、その目的は、情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、効率的に地理空間情報を生成するための装置、方法、及びプログラムを提供することにある。 The present invention has been made in view of such problems, and an object thereof is to efficiently map geographical information by associating a specific point or area in space with original information available on an information distribution site. Abstract: An apparatus, method and program for generating spatial information.
 このような目的を達成するために、本発明の第1の態様は、情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための地理空間情報生成装置であって、前記元情報を収集する収集部と、前記元情報に対し、前記元情報に関連する施設情報を付与して地理空間情報を生成する付与部とを備え、前記付与部は、前記元情報に含まれるURLに基づいて前記施設情報を付与することを特徴とする。 In order to achieve such an object, a first aspect of the present invention relates to generating geospatial information by associating a specific point or area in space with original information available on an information distribution site. The geospatial information generation apparatus according to claim 1, further comprising: a collection unit that collects the original information; and an addition unit that adds facility information related to the original information to the original information to generate geospatial information. The assigning unit assigns the facility information based on a URL included in the original information.
 また、本発明の第2の態様は、第1の態様において、前記収集部がURLを含む元情報のみを収集することを特徴とする。 Further, according to a second aspect of the present invention, in the first aspect, the collecting unit collects only the original information including the URL.
 また、本発明の第3の態様は、第2の態様において、前記収集部が、前記情報流通サイトのAPIに「http:」又は「https:」の文字列を与えてリクエストを行うことにより元情報を収集することを特徴とする。 Further, according to a third aspect of the present invention, in the second aspect, the collecting unit makes a request by giving a character string of “http:” or “https:” to an API of the information distribution site and making a request. It is characterized by collecting information.
 また、本発明の第4の態様は、第1から第3のいずれかの態様において、前記付与部が前記URLで表示されるウェブサイトのソースコードをスクレイピングして、前記施設情報を付与することを特徴とする。 Further, according to a fourth aspect of the present invention, in any one of the first to third aspects, the giving unit scrapes a source code of a website displayed by the URL to give the facility information. It is characterized by
 また、本発明の第5の態様は、第4の態様において、前記付与部が、前記スクレイピングの前に、前記元情報を前記情報流通サイトに提供するウェブサービスを判定することを特徴とする。 Further, according to a fifth aspect of the present invention, in the fourth aspect, the assigning unit determines the web service that provides the original information to the information distribution site before the scraping.
 また、本発明の第6の態様は、第1から第5のいずれかの態様において、前記施設情報が施設名であることを特徴とする。 Further, according to a sixth aspect of the present invention, in any one of the first to fifth aspects, the facility information is a facility name.
 また、本発明の第7の態様は、第6の態様において、前記施設名に基づいて、施設名と経緯度との対応づけを保持する施設データベースを参照して経緯度を取得することを特徴とする。 Further, according to a seventh aspect of the present invention, in the sixth aspect, the longitude and latitude is acquired with reference to a facility database which holds the correspondence between the facility name and the latitude and longitude based on the facility name. I assume.
 また、本発明の第8の態様は、第7の態様において、前記地理空間情報が、前記経緯度を逆ジオコーディングして得られた住所を含むことを特徴とする。 An eighth aspect of the present invention is characterized in that, in the seventh aspect, the geospatial information includes an address obtained by reverse geocoding the latitude and longitude.
 また、本発明の第9の態様は、第6又は第7のいずれかの態様において、前記元情報の一部に基づいて前記施設データベースの参照範囲を前記施設名の施設の所在地域と推定される範囲に限定してから、前記施設データベースを参照することを特徴とする。 Further, according to the ninth aspect of the present invention, in any of the sixth and seventh aspects, the reference range of the facility database is estimated to be the location area of the facility having the facility name based on a part of the original information. And the facility database is referred to.
 また、本発明の第10の態様は、第6から第9のいずれかの態様において、前記施設情報が施設種別をさらに含むことを特徴とする。 In a tenth aspect of the present invention according to any one of the sixth to ninth aspects, the facility information further includes a facility type.
 また、本発明の第11の態様は、第6から第10のいずれかの態様において、前記地理空間情報が、施設名及び施設種別又は施設名及び経緯度を含むことを特徴とする。 Further, according to an eleventh aspect of the present invention, in any one of the sixth to tenth aspects, the geospatial information includes a facility name and a facility type or a facility name and a latitude and longitude.
 また、本発明の第12の態様は、第1から第11のいずれかの態様において、前記情報流通サイトがSNSであることを特徴とする。 The twelfth aspect of the present invention is characterized in that, in any one of the first to eleventh aspects, the information distribution site is an SNS.
 また、本発明の第13の態様は、第1から第12のいずれかの態様において、前記収集及び前記付与がリアルタイムに行われることを特徴とする。 Further, according to a thirteenth aspect of the present invention, in any of the first to twelfth aspects, the collection and the application are performed in real time.
 また、本発明の第14の態様は、第1から第13のいずれかの態様において、前記地理空間情報を携帯端末に送信する送信部をさらに備え、前記送信部は、前記携帯端末の現在位置から所定の距離内の施設についての前記地理空間情報を送信することを特徴とする。 Further, according to a fourteenth aspect of the present invention, in any one of the first to thirteenth aspects, the transmitter further includes a transmitter configured to transmit the geospatial information to the mobile terminal, the transmitter being a current position of the mobile terminal Transmitting the geospatial information about a facility within a predetermined distance from.
 また、本発明の第15の態様は、第1から第13のいずれかの態様において、特定の地点又は区域に関連づけられた、過去又は現在の少なくとも一方の前記地理空間情報を集計してスコアを算出することを特徴とする。 Further, according to a fifteenth aspect of the present invention, in any one of the first to thirteenth aspects, at least one past or present geospatial information associated with a specific point or area is aggregated to obtain a score. It is characterized by calculating.
 また、本発明の第16の態様は、第15の態様において、前記スコアは、時間帯又は時期に応じて算出することを特徴とする。 A sixteenth aspect of the present invention is characterized in that, in the fifteenth aspect, the score is calculated according to a time zone or a time.
 また、本発明の第17の態様は、第15又は第16の態様において、前記スコアは、前記特定の地点又は区域に関連づけられた前記地理空間情報の量を、各地理空間情報に写真が含まれるか否かに応じた重み付けをして集計することにより算出されることを特徴とする。 Further, according to a seventeenth aspect of the present invention, in the fifteenth or sixteenth aspect, the score includes an amount of the geospatial information associated with the specific point or area, and a photograph includes each piece of geospatial information. It is characterized by being calculated by aggregating by weighting according to whether or not
 また、本発明の第18の態様は、第15又は第16の態様において、前記スコアは、前記特定の地点又は区域に関連づけられた前記地理空間情報の量を、各地理空間情報に含まれる文字列の言語に応じた重み付けをして集計することにより算出されることを特徴とする。 In addition, according to an eighteenth aspect of the present invention, in the fifteenth or sixteenth aspect, the score is a character including the amount of the geospatial information associated with the specific point or area in each geospatial information. It is characterized by being calculated by aggregating by weighting according to the language of the column.
 また、本発明の第19の態様は、情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための地理空間情報生成方法であって、前記元情報を収集するステップと、前記元情報に対し、前記元情報に関連する施設情報を付与して地理空間情報を生成するステップとを含むことを特徴とする。 A nineteenth aspect of the present invention is a geospatial information generation method for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site. And collecting the original information, and adding facility information related to the original information to the original information to generate geospatial information.
 また、本発明の第20の態様は、コンピュータに、情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための方法を実行させるための地理空間情報生成プログラムであって、前記方法は、前記元情報を収集するステップと、前記元情報に対し、前記元情報に関連する施設情報を付与して地理空間情報を生成するステップとを含むことを特徴とする。 Also, a twentieth aspect of the present invention causes a computer to execute a method for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site. Program for generating geospatial information, the method comprising: collecting the original information; and adding facility information related to the original information to the original information to generate geospatial information. It is characterized by including.
 本発明の一態様によれば、情報流通サイト上で入手可能な元情報に、当該元情報に含まれるURLに基づいて、当該元情報に関連する施設情報を付与することにより、信頼性の高い施設名等の施設情報を元情報に加えて、空間上の特定の地点又は区域を関連づけられた地理空間情報を効率的に生成することができる。 According to one aspect of the present invention, the facility information related to the source information is added to the source information available on the information distribution site based on the URL included in the source information, thereby providing high reliability. Facility information such as a facility name can be added to the original information to efficiently generate geospatial information associated with a specific point or area in space.
本発明にかかる地理空間情報生成装置の概略図である。1 is a schematic view of a geospatial information generation apparatus according to the present invention. 情報流通サイトから収集される元情報の一例を示す図である。It is a figure which shows an example of the original information collected from an information distribution site. TwitterのAPIからのレスポンスデータの一例を示す図である。It is a figure which shows an example of the response data from API of Twitter.
 以下、図面を参照して本発明の実施形態を詳細に説明する。なお、地理空間情報生成装置は、当該装置にて行われる各処理を実現するためのプログラムをコンピュータ上で実行することにより構成され、コンピュータは、単一のコンピュータに限られない。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Note that the geospatial information generation device is configured by executing a program for realizing each process performed by the device on a computer, and the computer is not limited to a single computer.
 (本発明の概要)
 図1は、本発明にかかる地理空間情報生成装置を示している。地理空間情報生成装置100は、情報流通サイト上で入手可能な元情報を収集する収集部110と、当該元情報に対し、当該元情報に関連する施設情報を付与して地理空間情報を生成する付与部120とを備える。
(Summary of the Invention)
FIG. 1 shows a geospatial information generation apparatus according to the present invention. The geospatial information generation apparatus 100 generates geospatial information by adding the facility information related to the source information to the source information and the collection unit 110 for collecting the source information available on the information distribution site. And a provision unit 120.
 収集部110が、Twitter等の情報流通サイトから収集する元情報は、たとえば図2に示されるように表示される。元情報200には、利用者ID201、投稿時間202、場所名203、URL204、エリア205、クチコミ206等が含まれる。収集は、情報流通サイトがキーワードサーチ用のAPIを提供している場合には、URLを特徴付ける文字列をAPIにリクエストすることにより行うことができる。これにより、URLを含む元情報のみを収集することができる。「http:」、「https:」等を文字列としてAPIに与えることで、URLを含む元情報を効率的に収集することができる。たとえば、Twitter上で入手可能な元情報、すなわち投稿のうち、URLを含むものは発明者らの調べによれば約5%程度であり、本発明の一実施形態では、投稿データの大半を破棄して積極的に解析対象から外し、地理空間情報を生成可能なものの選別を適切に行っている。さらに、APIからのレスポンスには、施設情報を関連づけることのできない元情報も含まれる場合があり、これらを除外するために、収集した元情報(レスポンス)データを解析することもできる。Twitterであれば、sourceタグを解析して、所定のウェブサービス(foursquare、instagram等)が発行するURLを含むものに限定することや、textタグを解析して「(1半角スペース)htt(p)」、「(1全角スペース)htt(p)」等を含むものにフィルタリングすることができる(図3(a)参照)。 The original information collected by the collection unit 110 from an information distribution site such as Twitter is displayed as shown in FIG. 2, for example. The source information 200 includes a user ID 201, a posting time 202, a place name 203, a URL 204, an area 205, a word of mouth 206, and the like. The collection can be performed by requesting a character string characterizing the URL from the API when the information distribution site provides the API for keyword search. Thereby, only the original information including the URL can be collected. By giving "http:", "https:", etc. as a character string to the API, the original information including the URL can be efficiently collected. For example, the original information available on Twitter, that is, about 5% of postings including URLs among the postings, according to the inventors' investigation, and in one embodiment of the present invention, most of the posting data is discarded. They are actively excluded from the analysis target and appropriately selected those capable of generating geospatial information. Furthermore, the response from the API may include original information to which facility information can not be associated, and in order to exclude these, it is possible to analyze the collected original information (response) data. In the case of Twitter, the source tag is analyzed and limited to those including a URL issued by a predetermined web service (foursquare, instagram, etc.), or the text tag is analyzed to “(1 half-size space) htt (p ), “(1 full-width space) htt (p)”, etc. can be filtered (see FIG. 3A).
 付与部120は、元情報200のURL204に基づいて、元情報に関連する施設情報を付与することができる。付与部120は、URL204にアクセスし、URL204で表示されるウェブサイトのソースコードをスクレイピング(ウェブ・スクレイピング)して、施設情報を抽出することができる。元情報200には、場所名203の表示があり、これに基づいて施設名を判定することもできるものの、短縮されている場合もあり、これが正確な施設名であるか否かの信頼性は必ずしも高くない。また、そもそも場所名203の表示がない場合もある。本発明の一実施形態では、URL204を辿ることによって、より信頼性の高い施設名等の施設情報を取得することを可能とする。 The assignment unit 120 can assign facility information related to the original information based on the URL 204 of the original information 200. The giving unit 120 can access the URL 204 and scrape (web scraping) the source code of the web site displayed by the URL 204 to extract the facility information. The source information 200 has a display of the place name 203, and although the facility name can be determined based on this, it may be shortened, and the reliability of whether this is the correct facility name is Not necessarily expensive. In addition, there may be no display of the place name 203 in the first place. In one embodiment of the present invention, it is possible to obtain facility information such as more reliable facility names by tracing the URL 204.
 なお、収集部110における収集と、付与部120における付与とは、リアルタイムに行うことができる。情報流通サイトによっては、元情報が秒単位で生成されており、このような情報に対するニーズも短期間に生じ、なくなっていくものであるため、情報の鮮度あるいは即時性が極めて重要である。そのため、収集と付与とを秒単位で繰り返すリアルタイム処理が求められる。たとえば、災害、防災等に関する情報のリアルタイム処理に対するニーズが挙げられる。収集については、たとえば、APIへのリクエスト元を多重化させることによって、処理速度の高速化を図ることができる。 The collection in the collection unit 110 and the assignment in the assignment unit 120 can be performed in real time. Some information distribution sites generate the original information in seconds, and the need for such information also occurs in a short period of time and disappears, so the freshness or immediacy of the information is extremely important. Therefore, real-time processing that repeats collection and assignment in seconds is required. For example, there is a need for real-time processing of information on disasters, disaster prevention, etc. With regard to collection, for example, the processing speed can be increased by multiplexing request sources to the API.
 地理空間情報生成装置100は、生成した地理空間情報を携帯端末に送信する送信部130をさらに備えることができる。送信部130は、当該携帯端末の現在位置から所定の距離内の施設についての地理空間情報を送信することができる。携帯端末では、受信した地理空間情報を閲覧することや、地図上に表示することで、現在位置に密接に関連した情報を入手することができる。たとえば、元情報がクチコミ情報を含む場合には、カーナビ、歩行者・自転車向けのナビゲーションサービス、地域情報サービス等に応用することで、単純な地図、経路情報等ではなく、目的地、現在地等の周囲で注目を集めている施設の存在を簡便に知らせることができる。 The geospatial information generation device 100 may further include a transmitting unit 130 that transmits the generated geospatial information to the mobile terminal. The transmission unit 130 can transmit geospatial information on facilities within a predetermined distance from the current position of the mobile terminal. The mobile terminal can obtain information closely related to the current position by browsing the received geospatial information or displaying it on a map. For example, if the original information includes word-of-mouth information, it can be applied to car navigation systems, navigation services for pedestrians and bicycles, regional information services, etc., and not simple maps, route information, etc. It can be easily informed of the existence of a facility that is attracting attention in the surrounding area.
 (施設情報の付与)
 スクレイピングされるソースコードには、情報流通サイトに元情報を提供するウェブサービスにもよるが、元情報に関連づけられた施設情報、具体的には施設名、経緯度、施設種別等が、解析可能な形で記載されている。どのような形で記載がなされているかは、上述のとおりウェブサービスにより異なることがあるため、ウェブサービスの判定を行ってから、当該ウェブサービスに対応するスクレイピング・ロジックを選択することができる。さらに、同一のウェブサービスから提供される元情報であっても、たとえばSNSへの投稿における画像の有無等、元情報の具体的な内容に応じてソースコードの記述方式が異なることがあるため、具体的な内容に対応するスクレイピング・ロジックを用意しておき、選択することもできる。
(Grant of facility information)
Although the source code to be scraped depends on the web service that provides the original information to the information distribution site, the facility information associated with the original information, specifically the facility name, the latitude and longitude, the facility type, etc. can be analyzed It is described in the form. Since the form in which the description is made may differ depending on the web service as described above, it is possible to determine the web service and then select the scraping logic corresponding to the web service. Furthermore, even if the source information is provided from the same web service, the source code description method may differ depending on the specific content of the source information, such as the presence or absence of an image in posting to SNS, for example. It is also possible to prepare scraping logic corresponding to specific contents and select it.
 ウェブサービスの判定は、情報流通サイトがTwitterである場合には、文字列とウェブサービスとの対応づけを保持する対応テーブルを参照して、収集部110が収集したレスポンスデータのsourceタグ(図3(a)参照)に含まれる文字列に基づいて行うことができる。 When the information distribution site is Twitter, the determination of the web service refers to the correspondence table holding the correspondence between the character string and the web service, and the source tag of the response data collected by the collection unit 110 (FIG. 3). (A) can be performed based on the character string included in (a).
 経緯度(緯度経度)は、ソースコードのスクレイピングによって、判定できる場合とできない場合があり、判定できない場合には、判定された施設名に基づいて、施設名と経緯度との対応づけを保持する施設データベースを参照して判定することもできる。この際、元情報の一部、たとえば行政区画等のエリア205、レスポンスデータに含まれる経緯度等に基づいて当該施設データベースの参照範囲を当該施設の所在地域と推定される範囲に限定してから、施設名に基づいて参照することで、判定精度を高めることができる。このようにすることで、同一の施設名ではあるものの、大きく離れた施設と判定されてしまうことを、たとえば防ぐことができる。この際、推定所在地域への限定をあらかじめ行っておくと、表記揺れに対する対応を要する施設名のマッチング処理数を減少させることができる。エリア205は、元情報のレスポンスデータでは、たとえば図3(b)のように記述される。 The latitude and longitude (latitude and longitude) may or may not be determined by scraping the source code, and if it can not be determined, the association between the facility name and the latitude and longitude is retained based on the determined facility name. It can also be determined by referring to the facility database. At this time, the reference range of the facility database is limited to a range estimated to be the location area of the facility based on a part of the original information, for example, the area 205 such as an administrative division, and the degree of longitude included in the response data. The determination accuracy can be enhanced by referring to the facility name. By doing this, it is possible to prevent, for example, determination as a facility which is the same as the facility name but is greatly separated. At this time, if limitation to the estimated location area is performed in advance, it is possible to reduce the number of matching processes of the facility name which needs to cope with the writing fluctuation. The area 205 is described in, for example, the response data of the original information as shown in FIG.
 施設種別は、判定できない場合には、判定された施設名に基づいて、施設名と施設種別との対応づけを保持する施設種別データベースを参照して判定することもできる。また、飲食店、観光施設等に関する情報サイト等のウェブサービスが提供するAPIに、判定された施設名をリクエストし、レスポンスデータを解析することにより、施設種別を判定することもできる。この際、施設名に対して、元情報の一部に基づいて当該施設名を補った後に参照をすることで、判定精度を高めることができる。 If the facility type can not be determined, it can also be determined based on the determined facility name, with reference to the facility type database holding the correspondence between the facility name and the facility type. The facility type can also be determined by requesting the determined facility name from an API provided by a web service such as an information site about a restaurant, a tourist facility, etc. and analyzing the response data. At this time, the determination accuracy can be enhanced by referring to the facility name after the facility name is compensated based on a part of the original information.
 住所は、経緯度を変換すること(「逆ジオコーディング」とも呼ばれる。)により取得可能である。 The address can be obtained by converting the latitude and longitude (also called "inverse geocoding").
 (地理空間情報の分析)
 上述のように生成される地理空間情報は、元情報が密接に関連する地点又は区域を特定するものであるため、様々な活用が可能となる。
(Analysis of geospatial information)
Since the geospatial information generated as described above identifies points or areas to which the original information is closely related, various applications are possible.
 たとえば、特定の地点又は区域に関連づけられた、過去から現在までの地理空間情報のすべて又は一部を集計して混雑具合、人気度、話題度、魅力度、危険度等の各種のスコアを算出することができる。特定の施設に関連づけられた地理空間情報の量を集計すれば、その施設が注目の時点、時間帯、時期、又は期間においてどの程度混雑しているかを評価することができる。また、地理空間情報の量のみならず、各地理空間情報が写真を含むか否かによって重み付けして集計することで、写真を含む投稿は魅力が高いと感じられる傾向にあるため、その施設の魅力度を評価することができる。また、施設名に「駅」が含まれる場合には、魅力度の評価においてはそのような元情報を除外することで、乗降客数が1日平均数十万人にのぼるような駅が単に混雑しているのみにも関わらず魅力度が高い施設として評価されてしまうことを避けることができる。また、投稿に含まれる文字列に危機管理関係の所定のキーワードが含まれているか否かにより重み付けをすることにより、危険度を評価することができる。このような各種スコアは、民間企業のマーケティング、公的機関における危機管理、一般コンシューマー、旅行者等に対する人気情報・流行情報の提供等、多様な場面での活用が見込まれる。 For example, all or part of the geospatial information from the past to the present, which is associated with a specific point or area, is aggregated to calculate various scores such as congestion degree, popularity degree, topic degree, attraction degree, risk degree can do. By aggregating the amount of geospatial information associated with a particular facility, it is possible to assess how crowded the facility is at the time, time, time or period of interest. In addition, it is not only the amount of geospatial information, but it is weighted by totaling whether each geospatial information contains a photograph, and it is because the post including a photograph tends to be felt attractive. The degree of attraction can be assessed. In addition, when the station name is included in the facility name, by excluding such original information in the evaluation of attractiveness, stations where the number of passengers gets to an average of several hundred thousand per day are simply crowded It is possible to avoid being evaluated as a highly attractive facility despite the fact that you are doing it. Also, the degree of risk can be evaluated by weighting based on whether or not the character string included in the post includes a predetermined keyword related to crisis management. Such various scores are expected to be used in various situations, such as marketing of private enterprises, crisis management in public organizations, and provision of popular information and trendy information to general consumers, tourists, etc.
 一例としては、地理空間情報に含まれる文字列の言語に応じた重み付けをして集計することにより、特定の文化圏を対象としたスコア算出を行うこともできる。中国人観光客向けにある地域で人気の高い施設を知らせるためには、たとえば当該地域に関連づけられたSNSへの投稿データを、写真を含み、かつ、中国語の文字列を含む投稿データのみにフィルタリングすることによって、中国人観光者向けの人気施設ランキングを提供することができるようになる。このような行動傾向の分析がアンケート等の事前準備をすることなく可能になる点は地理空間情報の用途を大きく拡げることとなる。 As an example, it is also possible to calculate a score for a specific culture area by performing weighting according to the language of character strings included in geospatial information and totaling. In order to inform popular facilities in a region for Chinese tourists, for example, post data to the SNS associated with the area should be only for post data that includes photos and contains Chinese characters. By filtering, it will be possible to provide popular tourist rankings for Chinese tourists. The point at which such analysis of behavioral tendency can be made without preparing in advance such as a questionnaire greatly expands the application of geospatial information.
 (各種データベース)
 地理空間情報を蓄積することでデータベースの構築が可能であり、各レコードは、元情報に加えて、施設名、施設種別、経緯度、住所等を含むことができる。さらに、各施設を一意に特定する施設IDを加えてもよい。元情報がSNSへの投稿データである場合には、当該データベースは、利用者ID、投稿ID、投稿テキスト、投稿画像、日付等をさらに含むことができる。これらの情報に基づいて、各種の特定用途のデータベースを構築することができる。
(Various databases)
By accumulating geospatial information, construction of a database is possible, and each record can include facility name, facility type, latitude and longitude, address, etc. in addition to the original information. Furthermore, you may add plant | facility ID which identifies each plant | facility uniquely. When the original information is post data to SNS, the database can further include a user ID, a post ID, a post text, a post image, a date, and the like. Based on these pieces of information, various special purpose databases can be constructed.
 「施設データベース」として、施設ID、施設名、施設種別、経緯度、及び住所で構成されるデータベースを構築することができる。各施設について、人気度等のスコアを算出しておいてもよい。 As "facility database", a database composed of facility ID, facility name, facility type, latitude and longitude, and address can be constructed. For each facility, scores such as popularity may be calculated.
 「利用者データベース」として、利用者IDに基づいて当該利用者の居住エリア、勤務エリア、性別、職業、年齢等を判定して各利用者の属性で構成されるデータベースを構築することができる。利用者データベースが構築されていると、たとえば、元情報に含まれる文字列の言語分析の代わりに、元情報の発信者の居住エリアによりフィルタリングをすることで、特定の文化圏あるいは出身地向けの魅力度を評価することができる。 As the “user database”, it is possible to construct a database configured of the attributes of each user by determining the residence area, work area, gender, occupation, age, etc. of the user based on the user ID. If a user database is constructed, for example, instead of linguistic analysis of strings contained in the original information, filtering by the residence area of the originator of the original information is performed for a specific cultural area or birthplace. The degree of attraction can be assessed.
 「エリア特徴データベース」として、エリア(区域)、時間又は時間帯、平日又は休日、エンターテインメント指数、飲食指数、夜指数、旅行指数、宿泊指数、アウトドア指数、女性指数、男性指数、旅行者指数、居住者指数、外国人指数等により構成されるデータベースを構築することができる。地球上に存在する各エリアに対して、その場所のもつ複合的な特徴等を算出することができる。たとえば、渋谷液半径1kmエリアの休日午後3時台のエンターテインメントを算出することによって、当該エリアの該当時間における消費者のエンターテインメント行動の傾向を表すことができる。 Area (area), time or time zone, weekday or holiday, entertainment index, eating and drinking index, night index, travel index, accommodation index, outdoor index, female index, male index, traveler index, residence as “area feature database” It is possible to construct a database composed of the index of foreigners, index of foreigners, etc. For each area present on the earth, it is possible to calculate the complex features of the place. For example, it is possible to express the tendency of the consumer's entertainment behavior at the corresponding time of the area by calculating the entertainment at around 3 o'clock pm on holidays in the area with a radius of 1 km in the Shibuya liquid.
 生成された行動情報から所定のエリアに関連づけられたものを抽出し、時間又は時間帯及び休日平日は、元情報200に含まれる投稿時間202等から得ることができる。エンターテインメント指数は、「地理空間情報」と「利用者データベース」と「施設データベース」を使って日々の消費者のエンターテインメント施設に関わる行動と利用者特徴と施設特徴を掛けあわせて集計することにより得ることができる。また、その他の指数も同様である。 From the generated action information, one associated with a predetermined area is extracted, and a time or a time zone and a holiday weekday can be obtained from the posting time 202 and the like included in the original information 200. Entertainment index is obtained by multiplying daily consumer's entertainment facility related activities and user characteristics and facility characteristics using "geography spatial information", "user database" and "facility database" Can. The other indices are also the same.
 たとえば、ある人物の移動履歴として人物ID、経緯度、時間又は時間帯等があるとき、エリア特徴データベースの各指数を複合的に組み合わせることにより自動的に人物の特徴(シルエット情報)を算出することができる。移動履歴に含まれる経緯度及び時間又は時間帯に基づいてエリア特徴データベースを参照することで、特定のエリア、時間又は時間帯、平日又は休日において高い値を取る指数を読み出し、当該人物の特徴が分かるのである。 For example, when there is a person ID, longitude and latitude, time or time zone as a movement history of a person, the feature (silhous information) of the person is automatically calculated by combining each index of the area feature database. Can. By referring to the area feature database based on the latitude and longitude and time or time zone included in the movement history, an index that takes a high value in a specific area, time or time zone, weekday or holiday is read out, and the person's feature is I know.
 渋谷駅半径1kmエリアの休日午後3時台のエンターテインメント指数等、各エリアを特徴づける指数が分かると、インターネット広告配信時の配信ターゲットの特定に有効な指標を提供することができる。具体的には、渋谷駅半径1kmエリアに休日午後3時台にスマートフォンデバイスを利用している消費者に広告を配信する場合に、その場でインターネット広告配信事業者が渋谷駅半径1kmエリアの休日午後3時台の各種指数を参照することで、より消費者の状況に合った広告の配信ができる。他にも、ショッピングセンターの出店計画、屋外広告、広告配信に最適なエリアや時間帯を自動的に算出することも可能である。 Knowing the indexes that characterize each area, such as entertainment indexes at around 3 pm on a holiday at a radius of 1 km in Shibuya Station radius, it is possible to provide an effective index for specifying a delivery target at the time of Internet advertisement delivery. Specifically, when an advertisement is delivered to a consumer using a smartphone device at 3:00 pm on a holiday in the Shibuya station radius of 1 km area, an Internet advertisement distribution provider on the spot is a holiday at the Shibuya station radius of 1 km area By referring to the various indexes in the 3 pm range, it is possible to deliver advertisements more in line with consumer conditions. In addition, it is also possible to automatically calculate the area and time zone that are optimal for shopping center opening plans, outdoor advertisements, and advertisement distribution.
 100 地理空間情報生成装置
 110 収集部
 120 付与部
 130 送信部
 200 元情報
 201 利用者ID
 202 投稿時間
 203 場所名
 204 URL
 205 エリア
100 geospatial information generation device 110 collection unit 120 addition unit 130 transmission unit 200 original information 201 user ID
202 Posting time 203 Place name 204 URL
205 area

Claims (20)

  1.  情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための地理空間情報生成装置であって、
     前記元情報を収集する収集部と、
     前記元情報に対し、前記元情報に関連する施設情報を付与して地理空間情報を生成する付与部と
    を備え、
     前記付与部は、前記元情報に含まれるURLに基づいて前記施設情報を付与することを特徴とする地理空間情報生成装置。
    A geospatial information generation device for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site,
    A collection unit that collects the original information;
    And an assigning unit that assigns facility information related to the source information to the source information to generate geospatial information.
    The geospatial information generation apparatus, wherein the giving unit gives the facility information based on a URL included in the original information.
  2.  前記収集部は、URLを含む元情報のみを収集することを特徴とする請求項1に記載の地理空間情報生成装置。 The geospatial information generation device according to claim 1, wherein the collecting unit collects only the original information including the URL.
  3.  前記収集部は、前記情報流通サイトのAPIに「http:」又は「https:」の文字列を与えてリクエストを行うことにより元情報を収集することを特徴とする請求項2に記載の地理空間情報生成装置。 The geospatial space according to claim 2, wherein the collection unit collects the original information by making a request by giving the character string "http:" or "https:" to the API of the information distribution site. Information generator.
  4.  前記付与部は、前記URLで表示されるウェブサイトのソースコードをスクレイピングして、前記施設情報を付与することを特徴とする請求項1から3のいずれかに記載の地理空間情報生成装置。 The geospatial information generation apparatus according to any one of claims 1 to 3, wherein the giving unit scrapes the source code of the website displayed by the URL to give the facility information.
  5.  前記付与部は、前記スクレイピングの前に、前記元情報を前記情報流通サイトに提供するウェブサービスを判定することを特徴とする請求項4に記載の地理空間情報生成装置。 5. The geospatial information generation apparatus according to claim 4, wherein the giving unit determines a web service for providing the original information to the information distribution site before the scraping.
  6.  前記施設情報は、施設名であることを特徴とする請求項1から5のいずれかに記載の地理空間情報生成装置。 The geospatial information generation apparatus according to any one of claims 1 to 5, wherein the facility information is a facility name.
  7.  前記施設名に基づいて、施設名と経緯度との対応づけを保持する施設データベースを参照して経緯度を取得することを特徴とする請求項6に記載の地理空間情報生成装置。 The geospatial information generation apparatus according to claim 6, wherein the latitude and longitude is acquired with reference to a facility database that holds the association between the facility name and the latitude and longitude based on the facility name.
  8.  前記地理空間情報は、前記経緯度を逆ジオコーディングして得られた住所を含むことを特徴とする請求項7に記載の地理空間情報生成装置。 The geospatial information generation apparatus according to claim 7, wherein the geospatial information includes an address obtained by reverse geocoding the latitude and longitude.
  9.  前記元情報の一部に基づいて前記施設データベースの参照範囲を前記施設名の施設の所在地域と推定される範囲に限定してから、前記施設データベースを参照することを特徴とする請求項7又は8に記載の地理空間情報生成装置。 The facility database is referred to after limiting the reference range of the facility database to a range presumed to be the location area of the facility of the facility name based on a part of the original information. The geospatial information generation device according to 8.
  10.  前記施設情報は、施設種別をさらに含むことを特徴とする請求項6から9のいずれかに記載の地理空間情報生成装置。 The geospatial information generation apparatus according to any one of claims 6 to 9, wherein the facility information further includes a facility type.
  11.  前記地理空間情報は、施設名及び施設種別又は施設名及び経緯度を含むことを特徴とする請求項6から10のいずれかに記載の地理空間情報生成装置。 The geospatial information generation apparatus according to any one of claims 6 to 10, wherein the geospatial information includes an institution name and an institution type or an institution name and latitude and longitude.
  12.  前記情報流通サイトは、SNSであることを特徴とする請求項1から11のいずれかに記載の地理空間情報生成装置。 The geospatial information generation device according to any one of claims 1 to 11, wherein the information distribution site is an SNS.
  13.  前記収集及び前記付与は、リアルタイムに行われることを特徴とする請求項1から12のいずれかに記載の地理空間情報生成装置。 The geospatial information generation device according to any one of claims 1 to 12, wherein the collection and the assignment are performed in real time.
  14.  前記地理空間情報を携帯端末に送信する送信部をさらに備え、
     前記送信部は、前記携帯端末の現在位置から所定の距離内の施設についての前記地理空間情報を送信することを特徴とする請求項1から13のいずれかに記載の地理空間情報生成装置。
    The mobile terminal further includes a transmitter configured to transmit the geospatial information to the mobile terminal.
    The geospatial information generation apparatus according to any one of claims 1 to 13, wherein the transmitting unit transmits the geospatial information about a facility within a predetermined distance from a current position of the mobile terminal.
  15.  特定の地点又は区域に関連づけられた、過去又は現在の少なくとも一方の前記地理空間情報を集計してスコアを算出することを特徴とする請求項1から13のいずれかに記載の地理空間情報生成装置。 The geospatial information generation device according to any one of claims 1 to 13, wherein at least one of the past or present geospatial information associated with a specific point or area is calculated to calculate a score. .
  16.  前記スコアは、時間帯又は時期に応じて算出することを特徴とする請求項15に記載の地理空間情報生成装置。 The geospatial information generation device according to claim 15, wherein the score is calculated according to a time zone or a time.
  17.  前記スコアは、前記特定の地点又は区域に関連づけられた前記地理空間情報の量を、各地理空間情報に写真が含まれるか否かに応じた重み付けをして集計することにより算出されることを特徴とする請求項15又は16に記載の地理空間情報生成装置。 The score may be calculated by aggregating the amount of the geospatial information associated with the specific point or area with a weighting according to whether or not each geospatial information includes a picture. The geospatial information generation apparatus according to claim 15 or 16, characterized in that:
  18.  前記スコアは、前記特定の地点又は区域に関連づけられた前記地理空間情報の量を、各地理空間情報に含まれる文字列の言語に応じた重み付けをして集計することにより算出されることを特徴とする請求項15又は16に記載の地理空間情報生成装置。 The score is calculated by aggregating the amount of the geospatial information associated with the specific point or area by weighting according to the language of the character string included in each geospatial information. The geospatial information generation apparatus according to claim 15 or 16.
  19.  情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための地理空間情報生成方法であって、
     前記元情報を収集するステップと、
     前記元情報に対し、前記元情報に関連する施設情報を付与して地理空間情報を生成するステップと
    を含むことを特徴とする地理空間情報生成方法。
    A geospatial information generation method for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site,
    Collecting the source information;
    And f) adding facility information related to the source information to the source information to generate geospatial information.
  20.  コンピュータに、情報流通サイト上で入手可能な元情報に、空間上の特定の地点又は区域を関連づけて、地理空間情報を生成するための方法を実行させるための地理空間情報生成プログラムであって、
     前記方法は、
     前記元情報を収集するステップと、
     前記元情報に対し、前記元情報に関連する施設情報を付与して地理空間情報を生成するステップと
    を含むことを特徴とする地理空間情報生成プログラム。
    A geospatial information generation program for causing a computer to execute a method for generating geospatial information by associating a specific point or area in space with original information available on an information distribution site,
    The method is
    Collecting the source information;
    And adding the facility information related to the original information to the original information to generate the geospatial information.
PCT/JP2015/005044 2014-10-03 2015-10-02 Device, method, and program for generating geographical space information WO2016051810A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2016551561A JPWO2016051810A1 (en) 2014-10-03 2015-10-02 Apparatus, method, and program for generating geospatial information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2014205202 2014-10-03
JP2014-205202 2014-10-03

Publications (1)

Publication Number Publication Date
WO2016051810A1 true WO2016051810A1 (en) 2016-04-07

Family

ID=55629872

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2015/005044 WO2016051810A1 (en) 2014-10-03 2015-10-02 Device, method, and program for generating geographical space information

Country Status (2)

Country Link
JP (2) JPWO2016051810A1 (en)
WO (1) WO2016051810A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09259142A (en) * 1996-03-26 1997-10-03 Mitsubishi Electric Corp Device for place name retrieval with roman letter/ japanese syllabary notation
US20040194099A1 (en) * 2003-03-31 2004-09-30 John Lamping System and method for providing preferred language ordering of search results
JP2012247273A (en) * 2011-05-26 2012-12-13 Clarion Co Ltd Navigation system and navigation device
JP2013239064A (en) * 2012-05-16 2013-11-28 Nippon Telegr & Teleph Corp <Ntt> Document discrimination device, method and program
JP2014035691A (en) * 2012-08-09 2014-02-24 Toshiba Corp Information processing apparatus, information processing terminal, information processing program, and information processing method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09259142A (en) * 1996-03-26 1997-10-03 Mitsubishi Electric Corp Device for place name retrieval with roman letter/ japanese syllabary notation
US20040194099A1 (en) * 2003-03-31 2004-09-30 John Lamping System and method for providing preferred language ordering of search results
JP2012247273A (en) * 2011-05-26 2012-12-13 Clarion Co Ltd Navigation system and navigation device
JP2013239064A (en) * 2012-05-16 2013-11-28 Nippon Telegr & Teleph Corp <Ntt> Document discrimination device, method and program
JP2014035691A (en) * 2012-08-09 2014-02-24 Toshiba Corp Information processing apparatus, information processing terminal, information processing program, and information processing method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
AKIRA HATTORI ET AL.: "A Study of Effectively Finding Tweets with Location Information", SYMPOSIUM ON MULTIMEDIA, DISTRIBUTED, COOPERATIVE AND MOBILE SYSTEMS (DICOM02011) RONBUNSHU, vol. 2011, no. 1, 8 July 2011 (2011-07-08), pages 1526 - 1530, ISSN: 1882-0840 *
KAZUFUMI WATANABE ET AL.: "Real-World Micro- Event Detection on Twitter", 4TH RAKUTEN R & D SYMPOSIUM YOKOSHU, 19 November 2011 (2011-11-19), Retrieved from the Internet <URL:http://rit.rakuten.co.jp/conf/rrds4/papers/RRDS4-030.pdf> [retrieved on 20160105] *
KEISUKE SAEKI ET AL.: "Gaikokujin Muke Kanko Promotion Shien no Tameno Social Media Bunseki Shuho no Kenkyu", DAI 6 KAI FORUM ON DATA ENGINEERING AND INFORMATION MANAGEMENT, vol. B4-2, 3 May 2014 (2014-05-03), Retrieved from the Internet <URL:http://db-event.jpn.org/deim2014/final/proceedings/B4-2.pdf> [retrieved on 20160107] *
WATANABE, KAZUFUMI ET AL.: "Jasmine: A Real-time Local-event Detection System Based on Geolocation Information Propagated to Microblogs", PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2011, pages 2541 - 2544, Retrieved from the Internet <URL:http://doi.acm.org/10.1145/2063576.2064014> *

Also Published As

Publication number Publication date
JP6176889B1 (en) 2017-08-09
JP2017182818A (en) 2017-10-05
JPWO2016051810A1 (en) 2017-08-10

Similar Documents

Publication Publication Date Title
Marchiori et al. The role of prior experience in the perception of a tourism destination in user-generated content
Jani et al. User-generated destination image through weblogs: A comparison of pre-and post-visit images
US20140156749A1 (en) Selection of region-specific brand pages based on location information of social networking system users
Túñez López et al. Collaborative tourism communication 2.0: promotion, advertising and interactivity in government tourism websites in Latin America.
Rzeszewski Geosocial capta in geographical research–a critical analysis
Singh et al. Investigation of social behaviour patterns using location-based data–a melbourne case study
US10823578B1 (en) Geographical catchment data for electronic displays
Gao et al. User-generated content: A promising data source for urban informatics
US20170193073A1 (en) Characterizing data using descriptive tokens
US20150199441A1 (en) Method and apparatus for providing tour plan service
KR20140031496A (en) Travel service server and system for serving travel contents using the same
CN111199491A (en) Social circle recommendation method and device
Halim et al. Discovering new tourist attractions through social media data: a case study in Sabah Malaysia
Ružić et al. Development of mobile marketing in croatian tourism using location-based services
Yamamoto Navigation system for foreign tourists in Japan
JP6176889B1 (en) Apparatus, method, and program for generating geospatial information
Dragićević Facebook travel related usage patterns of tourism students
JP6920996B2 (en) Devices, methods, and programs for generating behavioral information for foreign tourists
Yamamoto et al. Social recommendation GIS for urban tourist spots
JP5758943B2 (en) Information providing apparatus, information providing method, and program
Arismayanti et al. Comparison Study of Foreign Tourist Preferences in Choose The Attributes of Tourism Destination Pre and Post Natural Disaster
Chen et al. Increasing customer loyalty in internet marketing
Ráthonyi et al. Web 2.0 and Tourism.
Djukic et al. Tweeting in open public space: Case study Belgrade
Nagano et al. On-Foot Content Tourism Support System in Urban Tourist Destinations of Japan

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15847563

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2016551561

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15847563

Country of ref document: EP

Kind code of ref document: A1