CN103118043B - A kind of recognition methods of user account and equipment - Google Patents

A kind of recognition methods of user account and equipment Download PDF

Info

Publication number
CN103118043B
CN103118043B CN201110363026.9A CN201110363026A CN103118043B CN 103118043 B CN103118043 B CN 103118043B CN 201110363026 A CN201110363026 A CN 201110363026A CN 103118043 B CN103118043 B CN 103118043B
Authority
CN
China
Prior art keywords
content
user
identifying information
information
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201110363026.9A
Other languages
Chinese (zh)
Other versions
CN103118043A (en
Inventor
张立柱
张正威
杨廷辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201110363026.9A priority Critical patent/CN103118043B/en
Publication of CN103118043A publication Critical patent/CN103118043A/en
Priority to HK13108718.9A priority patent/HK1181575A1/en
Application granted granted Critical
Publication of CN103118043B publication Critical patent/CN103118043B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

This application discloses a kind of recognition methods and equipment of user account, by gathering the identifying information corresponding to each user account, critical field and the model field of setting is comprised in described identifying information, when including at least one identical critical field content in the identifying information of any two user accounts, then determine that these two user accounts are registered by same user, otherwise, need to judge the content similarity of model field in the identifying information of these two user accounts, and judge these two user accounts whether by same user is registered according to the grading of the content similarity of described model field, thus solve the problem of the user account that effectively cannot identify same user registration from a large amount of user accounts.

Description

A kind of recognition methods of user account and equipment
Technical field
The application relates to technical field of the computer network, particularly relates to a kind of recognition methods and equipment of user account.
Background technology
Along with the development of Internet technology, Internet service also obtains develop rapidly, diversified internet site is that different users provides miscellaneous business service, as: user can utilize internet site carry out commodity dealing, deliver political situation of the time opinion and carry out make friend activity etc., thus by the business transaction that needs in actual life to carry out face-to-face or dealing, extend to virtual network field, enrich the business transaction mode of user.
When utilizing the mode of above-mentioned the Internet to carry out corresponding business, user needs by logging in this website in the mode of internet site registered user account, thus uses the business service information that this website provides for user, the business required for completing user.
Because the authentication systems such as current network identification card and Real-name Registration are not yet implemented, most website only needs user to provide from the user name named and other identity informations (as legal mailbox message, phone number) etc., just can register a new user account, therefore, same user can utilize different user names and the multiple user account of identity information repeated registration.
Website server is when the user account registered this locality manages, whether cannot distinguish registered user account is same user registration, Website server does not manage each registered user account with can only making any distinction between respectively, as to user account pushed information, this user account is logged in Website server after the operational administrative etc. that carries out.
And in fact, for between multiple user accounts that same user registers, probably there is certain correlation, operation after user uses each user account Website login server also has certain correlation, therefore, if a kind of scheme identifying the user account of same user registration from a large amount of user accounts can be found, the user account centralized management then same user can registered, effective raising is to the efficiency of management of user account, further, can also effectively analyze, follow the tracks of user behavior, user account to user's registration pushes useful information accurately.
Summary of the invention
The object of the application is: the recognition methods and the equipment that provide a kind of user account, in order to solve in existing internet site the problem of the user account that effectively cannot identify same user registration from a large amount of registered users accounts.
A recognition methods for user account, comprises the following steps:
The identifying information of collection of server user account, comprises critical field and the model field of setting in described identifying information;
For the identifying information of any two user accounts, described server judges whether have the content of at least one critical field identical in the identifying information of described two user accounts;
If so, then determine that these two user accounts are registered by same user;
Otherwise, the content similarity of the model field in described server determination identifying information, judge whether the content similarity of the model field in the identifying information of described two user accounts reaches to impose a condition, if, then determine that these two user accounts are registered by same user, otherwise, determine that these two user accounts are registered by different user;
The content similarity of the described model field determined in identifying information comprises:
Determine the weighted value of each model field;
Respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and each operation result is weighted summation with the weighted value of corresponding model field, obtain the content similarity of model field.
A kind of server, comprising:
Data acquisition module, for gathering the identifying information of user account, comprises critical field and the model field of setting in described identifying information;
Identify judge module, for receiving the identifying information of the user account that described data acquisition module sends, for the identifying information of any two user accounts, judge whether have the content of at least one critical field identical in the identifying information of described two user accounts, if, then determine that these two user accounts are registered by same user, otherwise, judge according to the operation result of the first similarity computing module whether the content similarity of the model field in the identifying information of described two user accounts reaches to impose a condition, if, then determine that these two user accounts are registered by same user, otherwise, determine that these two user accounts are registered by different user,
First similarity computing module, for determining the weighted value of each model field, and respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, each operation result is weighted summation with the weighted value of corresponding model field, obtains the content similarity of model field and be sent to identification judge module.
The beneficial effect of the application is as follows:
The embodiment of the present application provides a kind of recognition methods and equipment of user account, by gathering the identifying information corresponding to each user account, critical field and the model field of setting is comprised in described identifying information, when including at least one identical critical field content in the identifying information of any two user accounts, then determine that these two user accounts are registered by same user, otherwise, need to judge the content similarity of model field in the identifying information of these two user accounts, and judge these two user accounts whether by same user is registered according to the grading of the content similarity of described model field, thus solve the problem of the user account that effectively cannot identify same user registration from a large amount of user accounts.
Accompanying drawing explanation
Figure 1 shows that the recognition methods schematic flow sheet of user account in the embodiment of the present application one;
Figure 2 shows that the structural representation of server in the embodiment of the present application two.
Embodiment
The embodiment of the present application provides a kind of recognition methods and equipment of user account, by gathering the identifying information corresponding to each user account, critical field and the model field of setting is comprised in described identifying information, when including at least one identical critical field content in the identifying information of any two user accounts, then determine that these two user accounts are registered by same user, otherwise, need to judge the content similarity of model field in the identifying information of these two user accounts, and judge these two user accounts whether by same user is registered according to the grading of the content similarity of described model field, thus solve the problem of the user account that effectively cannot identify same user registration from a large amount of user accounts.
By the technical scheme of the application, the centralized management to each user account that same user registers, the effectively efficiency of management of raising to user account can be realized, further, effectively can also analyze, follow the tracks of user behavior, the user account to user's registration pushes useful information accurately.
Embodiment one:
As shown in Figure 1, be the recognition methods schematic flow sheet of user account in the embodiment of the present application one, described recognition methods comprises the following steps:
Step 101: the identifying information of collection of server user account, comprises critical field and the model field of setting in described identifying information.
Described user account refers to the virtual account for carrying out business conduct in respective wire at registration of website that user is registered by internet site.Described user can be the individual of registered user's account, company or tissue.
The identifying information of described user account comprises following at least one information:
(1) log-on message, during user account registration.
Described log-on message can be the information that user fills in when registered user's account, and such as, for personal user, described log-on message can comprise: the information such as name, ID card No., sex, E-mail address, level of education; For enterprise customer, described log-on message can comprise: the information such as enterprise name, organization mechanism code, the type of business, contact address.
(2), from the information that third party obtains.
Described third party can refer to government, the non-government institution etc. such as other websites or industrial and commercial unit, civil unit, judicial unit outside registration of website, the information obtained from described third party can be according to the content log-on message, obtain and these other information user-dependent from described third party, such as, determine the enterprise name of enterprise customer according to described log-on message after, the information such as the registered capital of this enterprise customer, affiliated industry can be obtained from industrial and commercial unit.
(3), the information that produces when running of user account, as when user account logs in the IP address that uses or MAC information etc.
Content in the identifying information gathered can be divided into two classes by its character, and a class is the content in critical field, and another kind of is content in model field.
Described critical field represents can uniquely or close to the identifying information of identity identifying user account information registrant uniquely, such as, for personal user, the title of critical field can be ID card No., passport number etc., for this critical field of ID card No., the ID card No. that its content can be filled in when registered user's account for user.
Described model field is the non-key field in identifying information, such as, for personal user, the title of model field can be sex, date of birth, nationality, E-mail address, level of education, for this model field of date of birth, the date of birth that its content can be filled in when registered user's account for user.
It should be noted that, the critical field or model field that preset might not be comprised in the identifying information gathered, that is, if the critical field preset or model field are arranged with the form of tables of data, for each identifying information gathered, by in the fills in identifying information to the tables of data shown in table 1 time, some critical field may be there is or model field meaningful, the content of some critical field or model field is empty situation.
Such as: as shown in table 1, for the identifying information of the user account A of the tables of data form of setting, described identifying information comprises critical field and the model field of setting, if contain the name of user, ID card No., sex, nationality in this identifying information gathered for user account A, name then in table 1 is called that the critical field of ID card No. is meaningful, and name is called that in the critical field of passport number, content is for empty; Name be called name, sex, nationality model field meaningful, name be called the date of birth, E-mail address model field content for empty.
Table 1
It should be noted that, due to identifying information be can comprise above three kinds of information (user account registration time log-on message, from third party obtain information and user account run time produce information), and these three kinds of information can obtain from different approach collections, there is the situation of different content in the field that may occur for the same names of same user account.Still for the situation shown in table 1, if determine from the log-on message of user account A, the content of this model field of nationality is for China, the IP address used when user account A logs in again determines that the content of this model field of nationality is for Japan, then occur the situation of different content for the field of the same names of same user account, the present embodiment includes but not limited to following two kinds of ways:
The first way: different according to the source that identifying information gathers, for the identifying information of separate sources assigns weight, that is, can be log-on message, from third party obtain information and user account run time produce information distribution weight, when there is the situation of different content in the field of the same names for same user account, determine the weight in source belonging to different content, retain the content that the higher source of weight provides.Such as: if the weight of log-on message is the highest, secondly the weight of the information obtained from third party, the weight of the information produced when user account runs is minimum, if then determine from the log-on message of user account A, the content of this model field of nationality is for China, the IP address used when user account A logs in again determines that the content of this model field of nationality is for Japan, the content of the nationality determined in log-on message can be retained, the content of the nationality that the IP address used when abandoning user account A login is determined.
The second way: by the content different for same field of separate sources, content all as this field retains, such as: if determine from the log-on message of user account A, the content of this model field of nationality is for China, the IP address used when user account A logs in again determines that the content of this model field of nationality is for Japan, can fill in China and Japan in the content of this model field of table 1 nationality.
After the identifying information collecting each user account, can this identifying information be stored in the database of server with the form of the tables of data shown in table 1, more preferably, can be the identifying information allocation identification of storage, not only can be inquired the identifying information of each user account from database by the mark of this distribution, can also identify this user account is that personal user registers or enterprise customer's registration.Such as: the mark that the identifying information for the personal user account A shown in table 1 distributes can be A_001, wherein, A represents that corresponding identifying information is the identifying information of personal user's account, and 001 is the sequence number of identifying information in personal user's account.Again such as, the mark that the identifying information for a certain enterprise customer's account distributes can be B_001, and wherein, B represents that corresponding identifying information is the identifying information of enterprise customer's account, and 001 is the sequence number of identifying information in enterprise customer's account.
In addition, carry out database purchase easily to enable the identifying information gathered in the application and analyze, the information that the information that no matter identifying information is log-on message, obtain from third party or user account produce when running, in the information of collection critical field and model field refer to need consistent.Such as: the model field being called nationality for name in table 1, no matter the identifying information gathered from where, for this model field of nationality, its content all should represent nationality's information of user.
Step 102: server judges whether have the content of at least one critical field identical in the identifying information of described two user accounts for any two without the identifying information of the user account identified, if so, then performs step 105; Otherwise, perform step 103.
More preferably, if in a step 101, the identifying information of user account is stored in the database of server with the form of tables of data, and be that the identifying information stored is assigned with the mark can distinguishing personal user's account and enterprise customer's account, then in this step 102, according to mark, the identifying information being all personal user's account can be performed this step 102, and the identifying information being all enterprise customer's account is performed this step 102.
Because the content of critical field in identifying information is that represent can uniquely or close to the content of identity identifying user account information registrant uniquely, therefore, as long as in two identifying informations, there is the content of at least one critical field identical, then can think that this user account corresponding to two identifying informations is same user registration.
Suppose, in the identifying information of user account A, the content of this critical field of ID card No. is identical with the content of this critical field in the identifying information of user account B, though then these two identifying informations other critical fielies or the content of model field has non-correlation, all can think user account A and user account B be same user registration.
The identifying information that this step 102 is also not limited to having stored in database compares between two.
Step 103: whether the content similarity of the model field that server judges in the identifying information of described two user accounts reaches and impose a condition, if so, then performs step 105; Otherwise, perform step 104.
Particularly, server can determine the content similarity of the model field in the identifying information of these two user accounts by following steps:
Step one: the weighted value first needing to determine each model field in the identifying information of described two user accounts.
According to this model field, the weighted value of model field can represent that the importance of user identity is determined, for representing the model field that the importance of user identity is higher, its weight of distributing is also higher.
More preferably, the mode that model algorithm is set can be adopted in the present embodiment to determine the weighted value of each model field in identifying information, specifically can adopt the mode of Analytic Hierarchy Process Model, by setting up hierarchy Model, structure Paired comparison matrix, calculate weight vector and the step doing consistency check obtains the weighted value of each model field.Adopt the mode of tectonic remnant basin analytical model to determine the mode of the weighted value of each model field, the error in the result that the weighted value of subjective setting model field can be avoided largely to bring, makes the result of the weighted value obtained more accurate.
Step 2: respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and each operation result is weighted summation with the weighted value of corresponding model field, obtain the content similarity of model field.
Similarity mathematical algorithm involved in the present embodiment is not limited to arbitrary algorithm available at present, such as: two model field contents are identical, then determines that its similarity is 1, if two model field contents are not identical, then determine that its similarity is 0, more such as, Hamming distances algorithm etc. can also be used.
After similarity between the content determining the model field that multiple title is identical, each operation result can be weighted summation with the weighted value of corresponding model field, such as: for nationality, E-mail address, sex and these 4 model field of date of birth carry out content similarity judgement, suppose that its weight is respectively a, b, c and d, after carrying out similarity computing for the content of these 4 model field in the identifying information of two user accounts, its operation result is respectively X1, X2, X3 and X4, then be weighted and be summed to: aX1+bX2+cX3+dX4, the result of this weighted sum is the content similarity of model field entirety in the identifying information of two user accounts.
Described imposing a condition can be arranged according to the required precision judged similarity, if the required precision judged similarity is higher, then described imposing a condition can be set to larger threshold value, if the required precision judged similarity is lower, then described imposing a condition can be set to less threshold value.
More preferably, when calculating the content similarity of model field entirety in the identifying information of two user accounts in this step 103, it is also conceivable to the reliability standard in identifying information source, for the information that source is reliable, can be it and distribute higher weight, the mode of the content similarity of the weighted value computation model field integral utilizing identifying information to originate is:
Respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and the weighted value of information belonging to the content determining to carry out similarity computing, then each operation result and the weighted value of information belonging to the content of carrying out similarity computing are weighted summation, obtain the content similarity of model field.
Such as: for nationality, E-mail address, sex and these 4 model field of date of birth carry out content similarity judgement, nationality's information, sex and date of birth are obtained by log-on message, E-mail address is obtained by third party, suppose that the weighted value of log-on message is A, be B from the weighted value of third party's obtaining information, after carrying out similarity computing for the content of these 4 model field in the identifying information of two user accounts, its operation result is respectively X1, X2, X3 and X4, then be weighted and be summed to: AX1+BX2+AX3+AX4, the result of this weighted sum is the content similarity of model field entirety in the identifying information of two user accounts.
Represent that if consider simultaneously the importance of user identity carrys out the weight of weight and the information source of distributing for model field, after then the model field of same names can being carried out content similarity computing, operation result is weighted summation with the weight of corresponding model field, the weight of information source respectively, obtains the content similarity of model field.
Still with for nationality, E-mail address, sex and these 4 model field of date of birth are carried out content similarity and are judged as example, the weight of hypothesized model field is respectively a, b, c and d, nationality's information, sex and date of birth are obtained by log-on message, E-mail address is obtained by third party, suppose that the weighted value of log-on message is A, be B from the weighted value of third party's obtaining information, after carrying out similarity computing for the content of these 4 model field in the identifying information of two user accounts, its operation result is respectively X1, X2, X3 and X4, then be weighted and be summed to: AaX1+BbX2+AcX3+AdX4, the result of this weighted sum is the content similarity of model field entirety in the identifying information of two user accounts.
Step 104: determine that described two user accounts are registered by different user, and jump to step 106.
More preferably, in order to ensure the reliability of judged result, the mode of manual intervention can also be adopted further again to judge to the result of step 104, the scheme of the present embodiment is also not limited to: if when the result that again judges of the mode of manual intervention is different from the determination result of step 104, be as the criterion with the result of manual intervention.
Step 105: determine that described two user accounts are registered by same user, and jump to step 106.
Similar with step 104, the mode of manual intervention can also be adopted again to judge to the determination result of this step 105.
In the scheme of the embodiment of the present invention, if also carried out artificial intervention operation after step 104 or step 105, then can by the operating process of the mode record manual intervention of log information and operating result.
Step 106: whether are judged results that same user register, and judge whether that also there are two user accounts does not identify, if so, then jumps to step 102 if obtaining two user accounts, otherwise, terminate the identifying of this user account.
By the scheme of the embodiment of the present invention one, whether multiple user accounts that can identify in same website or different web sites are registered by same user, for the multiple user accounts registered by same user, the plurality of user account can be bound, and provide informing mechanism and result queries mechanism to the Website server at described multiple user account place, so that the user account centralized management that Website server is registered same user, effectively improve the efficiency of management to user account.
Preferably, by to same user register the analysis of multiple user account, effectively can follow the tracks of the business conduct of this user, the comprehensive business development trend analyzing this user, and then arbitrary user account of the multiple user accounts can registered to this user pushes useful information accurately, not only reduce Internet resources all shared by pushed information of multiple user accounts that Website server registers to this user and management cost, and make the more purposive and specific aim of the propelling movement of information, be convenient to website service one side and accurately market.
Preferably, when identifying multiple user account and being registered by same user, described multiple user account can also be divided into master, secondary user account is (as judged according to the number of times of user's login user account, login times is primary user's account than more frequently, what compare less login is time user account), and the secondary user account of this user is carried out timing according to Offtime length, when timing reaches set point, nullify the secondary user account of this user, the memory space of Website server is made to obtain certain reduction, thus reduce the operating cost of corresponding internet site to a certain extent.
Preferably, when identifying multiple user account and being registered by same user, when wherein having a user account to have certain risk if determine, then other user accounts registered by same user bound all can be identified as the user account with risk; In addition, if identifying multiple user account is when same user's short-term, inherent same website was registered, and the quantity of described multiple user account is comparatively large, can think the malicious registration of user in website.Therefore, by the scheme of the embodiment of the present invention, the security risk of user account can also be avoided, reduce the Network Security Vulnerabilities that it brings, improve the fail safe of internet site.
Embodiment two:
As shown in Figure 2, for belonging to the structural representation of the server under same inventive concept with the embodiment of the present application one, described server comprises data acquisition module 11, identifies judge module 12 and the first similarity computing module 13.
Described data acquisition module 11, for gathering the identifying information of user account, comprises critical field and the model field of setting in described identifying information.
Particularly, the identifying information of user account that described data acquisition module 11 collects comprises following at least one information: the information that log-on message during user account registration, the information obtained from third party, user account produce when running; Critical field in the identifying information that described data acquisition module 11 gathers represents can uniquely or close to the identifying information of the identity of identification user account information registrant uniquely, described model field be then the non-key field in identifying information.
It should be noted that, the identifying information gathered to enable data acquisition module 11 carries out database purchase easily and analyzes, the information that the information that no matter identifying information is log-on message, obtain from third party or user account produce when running, in the information of collection critical field and model field refer to need consistent.
Described identification judge module 12 is for receiving the identifying information of the user account of described data acquisition module 11 transmission, for the identifying information of any two user accounts, judge whether have the content of at least one critical field identical in the identifying information of described two user accounts, if, then determine that these two user accounts are registered by same user, otherwise, judge whether the content similarity of the model field in the identifying information of described two user accounts reaches to impose a condition, if, then determine that these two user accounts are registered by same user, otherwise, determine that these two user accounts are registered by different user.
Described first similarity computing module 13 is specifically for determining the weighted value of each model field, and respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, each operation result is weighted summation with the weighted value of corresponding model field, obtains the content similarity of model field and be sent to identification judge module 12.
Particularly, described server can also comprise the second similarity computing module 14.
Described second similarity computing module 14 for determining log-on message in identifying information, the weighted value of information that produces when running of the information that obtains from third party and user account, respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and the weighted value of information belonging to the content determining to carry out similarity computing, and each operation result and the weighted value of information belonging to the content of carrying out similarity computing are weighted summation, obtain the content similarity of model field and be sent to and identify judge module 12.
The concrete content similarity also for obtaining according to described first similarity computing module 13 or the second similarity computing module 14 of described identification judge module 12, judges whether the content similarity of the model field in the identifying information of described two user accounts reaches and imposes a condition.
More preferably, described server can also comprise data memory module 15 and manual identified module 16.
The identifying information of each user account of described data memory module 15 for storing data acquisition module 11 and collecting; Described manual identified module 16 is for providing operation interface to receive manual intervention information, utilize the manual intervention information received to identifying that the recognition result that judge module 12 obtains judges again, if the result that the mode of manual intervention judges again, with when identifying that the determination result of judge module 12 is inconsistent, is as the criterion with the result of manual intervention.
It should be noted that, due to the critical field or model field that preset might not be comprised in the identifying information that data acquisition module 11 gathers, that is, if when default critical field and model field are stored in data memory module 15 with the form of tables of data or other forms, for each identifying information gathered, some critical field may be there is or model field meaningful, the content of some critical field or model field is empty situation; In addition, it should be noted that, the default critical field stored with the form of tables of data or other forms in data memory module 15 and the number of model field also should the dynamic changes according to the identifying information of the user account collected at every turn, more preferably, default critical field number or the number of model field can be increased according to the identifying information collected.
It should be noted that, the identifying information gathered due to data acquisition module 11 to comprise three kinds of information (information produced when log-on message during user account registration, the information obtained from third party and user account run), and these three kinds of information can obtain from different approach collections, there is the situation of different content in the field that may occur for the same names of same user account.Then occur the situation of different content for the field of the same names of same user account, the mode that data memory module 15 stores the identifying information of this user account can include but not limited to following two kinds of ways:
The first way: the source that data memory module 15 gathers according to identifying information is different, for the identifying information of separate sources assigns weight, when there is the situation of different content in the field of the same names for same user account, determine the weight in source belonging to different content, retain the content that the higher source of weight provides, and abandon the content that the lower source of other weights provides.
The second way: data memory module 15 is by the content different for same field of separate sources, and the content all as this field retains.
More preferably, described data memory module 15 can also distribute unique identification for the identifying information stored, not only can be inquired the identifying information of each user account from database by the mark of this distribution, can also identify this user account is that personal user registers or enterprise customer's registration.
More preferably, described data memory module 15 is also for binding the multiple user accounts registered by same user, and store binding information, the corresponding operating result that the artificial identification module 16 of the mode record of log information can also be adopted to carry out the identifying information judged result that this locality stores and operating process.
The embodiment of the present application provides a kind of recognition methods and equipment of user account, by gathering the identifying information of multiple user account, critical field and the model field of setting is comprised in described identifying information, when including at least one identical critical field content in the identifying information of any two user accounts, then determine that these two user accounts are registered by same user, otherwise, calculate the content similarity of the model field of described two user accounts and judge these two user accounts whether by same user is registered according to the content similarity result that obtains, thus solve and effectively cannot identify the problem whether multiple user account is same user registration, further, utilize the centralized management that the technical scheme of the application can realize each user account that same user registers, effective raising is to the efficiency of management of user account, can also effectively analyze, follow the tracks of user behavior, user account to user's registration pushes useful information accurately.
Obviously, those skilled in the art can carry out various change and modification to the application and not depart from the spirit and scope of the application.Like this, if these amendments of the application and modification belong within the scope of the application's claim and equivalent technologies thereof, then the application is also intended to comprise these change and modification.

Claims (6)

1. a recognition methods for user account, is characterized in that, comprises the following steps:
The identifying information of collection of server user account, comprises critical field and the model field of setting in described identifying information;
For the identifying information of any two user accounts, described server judges whether have the content of at least one critical field identical in the identifying information of described two user accounts;
If so, then determine that these two user accounts are registered by same user;
Otherwise, the content similarity of the model field in described server determination identifying information, judge whether the content similarity of the model field in the identifying information of described two user accounts reaches to impose a condition, if, then determine that these two user accounts are registered by same user, otherwise, determine that these two user accounts are registered by different user;
The content similarity of the described model field determined in identifying information comprises:
Determine the weighted value of each model field; Particularly, adopt the mode of Analytic Hierarchy Process Model, by setting up hierarchy Model, structure Paired comparison matrix, calculate weight vector and the step doing consistency check obtains the weighted value of each model field;
Respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and each operation result is weighted summation with the weighted value of corresponding model field, obtain the content similarity of model field;
Wherein, described identifying information comprises following at least one information:
User account registration time log-on message, from third party obtain information and user account run time produce information;
And the situation of different content appears in the field for the same names of same user account, determine the weight in source belonging to different content, retain the content that the higher source of weight provides, or by the content different for same field of separate sources, the content all as this field retains.
2. the recognition methods of user account as claimed in claim 1, is characterized in that, determine the content similarity of the model field in the identifying information of two user accounts in the following manner:
The weighted value of information that the information determine the log-on message in identifying information, obtaining from third party and user account produce when running;
Respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and the weighted value of information belonging to the content determining to carry out similarity computing;
Each operation result and the weighted value of information belonging to the content of carrying out similarity computing are weighted summation, obtain the content similarity of model field.
3. a server, is characterized in that, comprising:
Data acquisition module, for gathering the identifying information of user account, comprises critical field and the model field of setting in described identifying information;
Identify judge module, for receiving the identifying information of the user account that described data acquisition module sends, for the identifying information of any two user accounts, judge whether have the content of at least one critical field identical in the identifying information of described two user accounts, if, then determine that these two user accounts are registered by same user, otherwise, judge whether the content similarity of the model field in the identifying information of described two user accounts reaches to impose a condition, if, then determine that these two user accounts are registered by same user, otherwise, determine that these two user accounts are registered by different user,
First similarity computing module, for determining the weighted value of each model field, and respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, each operation result is weighted summation with the weighted value of corresponding model field, obtains the content similarity of model field and be sent to identification judge module; Particularly, adopt the mode of Analytic Hierarchy Process Model, by setting up hierarchy Model, structure Paired comparison matrix, calculate weight vector and the step doing consistency check obtains the weighted value of each model field;
Wherein, described data collecting module collected to the identifying information of user account specifically comprise following at least one:
User account registration time log-on message, from third party obtain information and user account run time produce information;
And the situation of different content appears in the field for the same names of same user account, determine the weight in source belonging to different content, retain the content that the higher source of weight provides, or by the content different for same field of separate sources, the content all as this field retains.
4. server as claimed in claim 3, is characterized in that, also comprise:
Second similarity computing module, for determining log-on message in identifying information, the weighted value of information that produces when running of the information that obtains from third party and user account, respectively the content of model field identical for title in the identifying information of two user accounts is carried out similarity computing, and the weighted value of information belonging to the content determining to carry out similarity computing, and each operation result and the weighted value of information belonging to the content of carrying out similarity computing are weighted summation, obtain the content similarity of model field;
Described identification judge module, specifically for the content similarity obtained according to described second similarity computing module, judges whether the content similarity of the model field in the identifying information of described two user accounts reaches and imposes a condition.
5. the server as described in as arbitrary in claim 3-4, is characterized in that, also comprise:
Data memory module, for storing the identifying information of each user account that data acquisition module collects;
Manual identified module, for providing operation interface to receive manual intervention information, utilizes the manual intervention information received to identifying that the recognition result that judge module obtains judges again.
6. server as claimed in claim 5, is characterized in that,
Described data memory module, also for being bound by the multiple user accounts registered by same user, and stores binding information.
CN201110363026.9A 2011-11-16 2011-11-16 A kind of recognition methods of user account and equipment Active CN103118043B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201110363026.9A CN103118043B (en) 2011-11-16 2011-11-16 A kind of recognition methods of user account and equipment
HK13108718.9A HK1181575A1 (en) 2011-11-16 2013-07-25 Method and device for identifying user account

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110363026.9A CN103118043B (en) 2011-11-16 2011-11-16 A kind of recognition methods of user account and equipment

Publications (2)

Publication Number Publication Date
CN103118043A CN103118043A (en) 2013-05-22
CN103118043B true CN103118043B (en) 2015-12-02

Family

ID=48416317

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110363026.9A Active CN103118043B (en) 2011-11-16 2011-11-16 A kind of recognition methods of user account and equipment

Country Status (2)

Country Link
CN (1) CN103118043B (en)
HK (1) HK1181575A1 (en)

Families Citing this family (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103544289A (en) * 2013-10-28 2014-01-29 公安部第三研究所 Feature extraction achieving method based on deploy and control data mining
CN104636971A (en) * 2013-11-06 2015-05-20 航天信息股份有限公司 Method of detecting one number for multiple names of value added tax invoice and system thereof
CN103646110B (en) * 2013-12-26 2017-01-11 中国人民银行征信中心 Natural person basic identity information matching method
CN105100029B (en) * 2014-05-22 2018-10-30 阿里巴巴集团控股有限公司 The method and apparatus that authentication is carried out to user
CN105243252B (en) * 2014-07-09 2018-08-07 阿里巴巴集团控股有限公司 A kind of method and device of account risk assessment
CN105306425B (en) * 2014-07-15 2020-01-10 腾讯科技(深圳)有限公司 Method and device for authenticating account number attribution
CN109670108B (en) * 2014-10-14 2023-08-01 创新先进技术有限公司 Information filtering method and device
CN105528383B (en) * 2014-10-27 2019-06-18 阿里巴巴集团控股有限公司 The recognition methods of account risk and device
CN110033302B (en) * 2014-10-28 2023-08-04 创新先进技术有限公司 Malicious account identification method and device
CN105634855B (en) * 2014-11-06 2019-03-19 阿里巴巴集团控股有限公司 The abnormality recognition method and device of network address
CN105656644B (en) * 2014-11-12 2019-02-01 阿里巴巴集团控股有限公司 The appraisal procedure and its system of account number importance in computer system
CN105653912B (en) * 2014-11-13 2018-06-01 阿里巴巴集团控股有限公司 A kind of method and device for identifying batch registration behavior
CN105718767B (en) * 2014-12-04 2020-01-31 阿里巴巴集团控股有限公司 information processing method and device based on risk identification
CN104574038B (en) * 2014-12-10 2018-11-23 百度在线网络技术(北京)有限公司 It is associated with main body identification and management method, device and equipment
CN105808988B (en) * 2014-12-31 2020-07-03 阿里巴巴集团控股有限公司 Method and device for identifying abnormal account
CN104573098B (en) * 2015-01-30 2018-05-29 深圳市华傲数据技术有限公司 Extensive object identifying method based on Spark systems
CN104573094B (en) * 2015-01-30 2018-05-29 深圳市华傲数据技术有限公司 Network account identifies matching process
CN104573095B (en) * 2015-01-30 2018-08-14 深圳市华傲数据技术有限公司 Extensive object identifying method based on Hadoop frames
CN106034149B (en) * 2015-03-13 2019-06-18 阿里巴巴集团控股有限公司 A kind of account recognition methods and device
CN106375259B (en) * 2015-07-20 2019-12-10 阿里巴巴集团控股有限公司 Same-user account identification method and device
CN106470204A (en) * 2015-08-21 2017-03-01 阿里巴巴集团控股有限公司 User identification method based on request behavior characteristicss, device, equipment and system
CN106572048A (en) * 2015-10-09 2017-04-19 腾讯科技(深圳)有限公司 Identification method and system of user information in social network
CN106570010A (en) * 2015-10-09 2017-04-19 阿里巴巴集团控股有限公司 Method and device for identifying consecutive number users in interaction data
CN106685898B (en) * 2015-11-09 2020-07-31 阿里巴巴集团控股有限公司 Method and equipment for identifying batch registered accounts
CN105516282B (en) * 2015-12-01 2019-06-11 深圳市元征科技股份有限公司 A kind of method and wearable device of data synchronization processing
CN107346310B (en) * 2016-05-05 2020-10-27 腾讯科技(深圳)有限公司 Account complaint processing method and server
CN105978717A (en) * 2016-05-09 2016-09-28 深圳市永兴元科技有限公司 Network account recognition method and device
CN105930501B (en) * 2016-05-09 2019-08-16 深圳市永兴元科技股份有限公司 Network account correlating method and device
CN106408316A (en) * 2016-11-23 2017-02-15 泰康保险集团股份有限公司 Method and device used for identifying clients
CN106713290B (en) * 2016-12-09 2020-03-17 腾讯科技(深圳)有限公司 Method for identifying main user account and server
CN106960143B (en) * 2017-03-23 2020-03-17 网易(杭州)网络有限公司 User account identification method and device, storage medium and electronic equipment
CN107426150B (en) * 2017-03-31 2020-09-04 武汉斗鱼网络科技有限公司 Method and system for monitoring and replacing user nickname
CN106970994B (en) * 2017-04-01 2019-07-12 长沙智擎信息技术有限公司 A kind of online practical demonstration extracting method of automation
CN108257061B (en) * 2017-06-30 2023-04-25 勤智数码科技股份有限公司 Government affair-oriented multiple data item association verification method
CN107391980B (en) * 2017-07-17 2020-09-29 上海众人网络安全技术有限公司 Login verification method, device, equipment and storage medium based on equipment data
CN107688603B (en) * 2017-07-25 2019-03-26 平安科技(深圳)有限公司 Electronic device, list remove weighing method and computer readable storage medium
CN107508835B (en) * 2017-09-25 2020-07-10 咪咕文化科技有限公司 Account verification method and device and computer readable storage medium
CN107730364A (en) * 2017-10-31 2018-02-23 北京麒麟合盛网络技术有限公司 user identification method and device
CN108052543B (en) * 2017-11-23 2021-02-26 北京工业大学 Microblog similar account detection method based on graph analysis clustering
CN107968953A (en) * 2017-11-28 2018-04-27 北京潘达互娱科技有限公司 Anti- cheating user method and device
CN108040117A (en) * 2017-12-22 2018-05-15 家园网络科技有限公司 Information-pushing method and device
CN108449327B (en) * 2018-02-27 2020-06-23 平安科技(深圳)有限公司 Account cleaning method and device, terminal equipment and storage medium
CN110324278A (en) * 2018-03-29 2019-10-11 北大方正集团有限公司 Account main body consistency detecting method, device and equipment
CN108632367A (en) * 2018-04-18 2018-10-09 家园网络科技有限公司 Account correlating method and information-pushing method
CN108924371B (en) * 2018-08-20 2020-05-22 国家电网有限公司客户服务中心 Method for identifying user number through incoming call number in electric power customer service process
CN111292113A (en) * 2018-12-07 2020-06-16 北京嘀嘀无限科技发展有限公司 Method, system and device for identifying real identity of user
CN111414406B (en) * 2019-01-04 2021-06-04 上海嗨普智能信息科技股份有限公司 Method and system for identifying same user in different channel transactions
CN110032843B (en) * 2019-03-04 2023-08-22 北京乐我无限科技有限责任公司 Account registration method and device, electronic equipment and storage medium
CN110222141A (en) * 2019-04-30 2019-09-10 北京品友互动信息技术股份公司 Data correlation method and device, electronic equipment
CN110557363A (en) * 2019-06-03 2019-12-10 北京城市网邻信息技术有限公司 identity verification method, device and storage medium
CN111104963B (en) * 2019-11-22 2023-10-24 贝壳技术有限公司 Target user determining method and device, storage medium and electronic equipment
CN111147511A (en) * 2019-12-31 2020-05-12 杭州涂鸦信息技术有限公司 User identity serial-parallel method and system
CN111241502B (en) * 2020-01-14 2023-01-17 恩亿科(北京)数据科技有限公司 Cross-device user identification method and device, electronic device and storage medium
CN113140328A (en) * 2020-01-19 2021-07-20 浙江爱多特大健康科技有限公司 Information pushing method, device, equipment and computer readable storage medium
CN111899085A (en) * 2020-06-15 2020-11-06 东方微银科技(北京)有限公司 Identity cross-validation method and device
CN111835730B (en) * 2020-06-18 2023-04-28 北京嘀嘀无限科技发展有限公司 Service account processing method and device, electronic equipment and readable storage medium
CN112288348B (en) * 2020-09-10 2021-10-15 浙江省疾病预防控制中心 Biological safety transfer supervision method and system
CN112291164B (en) * 2020-10-20 2023-04-07 浙江华创视讯科技有限公司 Repeated registration processing method, computer device and readable storage medium
CN113822691A (en) * 2020-10-28 2021-12-21 北京沃东天骏信息技术有限公司 User account identification method, device, system and medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7725421B1 (en) * 2006-07-26 2010-05-25 Google Inc. Duplicate account identification and scoring
CN102200987A (en) * 2011-01-27 2011-09-28 北京开心人信息技术有限公司 Method and system for searching sock puppet identification number based on behavioural analysis of user identification numbers

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8306986B2 (en) * 2005-09-30 2012-11-06 American Express Travel Related Services Company, Inc. Method, system, and computer program product for linking customer information

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7725421B1 (en) * 2006-07-26 2010-05-25 Google Inc. Duplicate account identification and scoring
CN102200987A (en) * 2011-01-27 2011-09-28 北京开心人信息技术有限公司 Method and system for searching sock puppet identification number based on behavioural analysis of user identification numbers

Also Published As

Publication number Publication date
CN103118043A (en) 2013-05-22
HK1181575A1 (en) 2013-11-08

Similar Documents

Publication Publication Date Title
CN103118043B (en) A kind of recognition methods of user account and equipment
CN103164416B (en) The recognition methods of a kind of customer relationship and equipment
CN112347754A (en) Building a Joint learning framework
CN110222893B (en) Method and device for recommending delivery places of shared traffic resources and electronic equipment
CN105023165A (en) Method, device and system for controlling release tasks in social networking platform
CN101957845B (en) On-line application system and implementation method thereof
CN107767153B (en) Data processing method and device
CN114429364A (en) Business data management method and device, storage medium and electronic equipment
CN108920479B (en) Cross-information-source account recommendation method for two micro terminals
CN111882113B (en) Enterprise mobile banking user prediction method and device
CN114626735A (en) Urging case allocation method, urging case allocation device, urging case allocation equipment and computer readable storage medium
CN108415971B (en) Method and device for recommending supply and demand information by using knowledge graph
KR101972660B1 (en) System and Method for Checking Fact
CN111882224A (en) Method and device for classifying consumption scenes
CN111798351A (en) Data processing method and device and readable storage medium
CN107784421A (en) Method for allocating tasks and device
CN116385045A (en) Data processing method, device and equipment for receiving and hosting additional service
CN114528448B (en) Accurate analytic system of drawing of portrait of global foreign trade customer
CN109213924B (en) Popularization task allocation method and device and computer equipment
CN109803022A (en) A kind of digitalization resource shared system and its method of servicing
CN113850669A (en) User grouping method and device, computer equipment and computer readable storage medium
CN110941709B (en) Information screening method and device, electronic equipment and readable storage medium
CN111143688B (en) Evaluation method and system based on mobile news client
CN115640325A (en) Content recommendation method and device, readable medium and electronic equipment
CN107741932A (en) User data fusion method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1181575

Country of ref document: HK

C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1181575

Country of ref document: HK

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20191212

Address after: P.O. Box 31119, grand exhibition hall, hibiscus street, 802 West Bay Road, Grand Cayman, Cayman Islands

Patentee after: Innovative advanced technology Co., Ltd

Address before: Cayman Islands Grand Cayman capital building, a four storey No. 847 mailbox

Patentee before: Alibaba Group Holding Co., Ltd.