WO2013104938A2 - Neural network and a method for teaching thereof - Google Patents

Neural network and a method for teaching thereof Download PDF

Info

Publication number
WO2013104938A2
WO2013104938A2 PCT/HU2013/000006 HU2013000006W WO2013104938A2 WO 2013104938 A2 WO2013104938 A2 WO 2013104938A2 HU 2013000006 W HU2013000006 W HU 2013000006W WO 2013104938 A2 WO2013104938 A2 WO 2013104938A2
Authority
WO
WIPO (PCT)
Prior art keywords
training
image
sub
neural network
layer
Prior art date
Application number
PCT/HU2013/000006
Other languages
French (fr)
Other versions
WO2013104938A3 (en
Inventor
Gábor BAYER
Original Assignee
77 Elektronika Muszeripari Kft.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 77 Elektronika Muszeripari Kft. filed Critical 77 Elektronika Muszeripari Kft.
Publication of WO2013104938A2 publication Critical patent/WO2013104938A2/en
Publication of WO2013104938A3 publication Critical patent/WO2013104938A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]

Definitions

  • the invention relates to a method for training a neural network and to a neural network trained with the method.
  • Neural networks are especially widely used for example in image recognition methods, for the recognition and categorisation of image elements representing objects, and furthermore preferably for automatic specification of the number of elements belonging to each category. These methods and apparatuses can be applied especially preferably in medical and diagnostic devices, e.g. in automatic analysis of body fluids such as urine or blood. In these methods a large number of training images are fed to a neural network dimensioned in accordance with the size of the images to be recognised and the information therein, and the weights linking the various neurons of the neural network are varied in the function of the correctness of the results provided by the neural network,.
  • One of the best known methods for tuning weights between neurons is the so-called back propagation.
  • the training procedure is extremely time consuming, and an extremely large number of weight values have to be determined during the procedure.
  • the known methods are less suitable for image recognition purposes in which it is necessary to categorise elements appearing in the images on the basis of visual information detectable in the images. This is because in the case of training with full size images, the neural network learns primarily not the particular categories, but those images in which various category elements may be present simultaneously. As a result of all these factors, by the known solutions, the training process for categorisation-based image recognition is less efficient or less tangible or controllable.
  • a further disadvantage of the known solutions is that they do not comprise steps by which wrong recognitions resulting from uncategorised elements could be managed.
  • 'convolution neural network means a special type of the neural networks, in which, except for the input layer, each neuron in each layer is in contact with a corresponding neuron block of the previous layer, which neuron block extends over the complete thickness of the layer (in direction z).
  • each neuron is linked with the same weights to the corresponding neurons in the neuron block of the previous layer.
  • the neuron blocks linked to adjacent neurons may in the given case be overlapping.
  • the object of the invention is to provide a training method for neural networks, especially in applications where it is required to categorise the elements appearing in the images.
  • the object of the invention is to provide a training method, which is free of the disadvantages of the prior art solutions to the greatest possible extent.
  • a further object of the invention is to provide a controllable and manageable training method for neural networks adapted for categorisation tasks.
  • An object of the invention furthermore is to provide a training method which results in faster converging and faster adjustment of the weights between the neurons than prior art techniques.
  • An object of the invention is furthermore to provide neural networks generated with the training method mentioned above.
  • Fig. 1 is a schematic view of generating a probability map with a neural network according to the invention
  • Fig. 2 is a schematic view of the structure of an exemplary convolution neural network according to the invention.
  • Fig. 3 is a schematic view of the output layer of an exemplary sub-network according to the invention.
  • Fig. 4 is a schematic view of the structure of an exemplary sub-network according to the invention.
  • the neural network and training method according to the invention will hereinafter be described primarily for the purpose of body fluid analysis, e.g. urine analysis. Such an analysis is an extremely useful practical example of an application where automatic categorisation of elements appearing in photos of samples is required.
  • an element shown in an image means a visual appearance of any object which can be recognised and categorised.
  • objects/elements to be categorised can be by way of example the following:
  • EPI - squamous epithelial cell
  • NEC non-squamous epithelial cell
  • WBC white blood cell
  • probability maps 11 1-n belonging to predetermined element categories are generated on the basis of the visual information detectable in image 10.
  • the various probability maps 11 show presence probability distribution of an element of the given category.
  • the probability maps 1 i -n may be generated also with the same resolution as that of the image 10. However, in the course of elaborating the invention, it has been recognised that for example in the case of high resolution images available in medical diagnostics, it would be extremely time consuming to analyse full resolution probability maps 1 1 1-n . It has been found that it is sufficient to generate probability maps 1 1 i -n of lower resolution than that of the original image 10, in a way that several pixels of the image 10 are associated with each of the probability values of the probability maps 1 1 i -n . In a preferred realization, a raster point is assigned to 4x4 pixels of the image 10 in the probability map. This probability value represents the presence probability of an element of the given category regardjng the given 4x4 pixels of the image 10.
  • the probability maps 11 i -n can be presented also as probability images, each pixel of which carrying visual information according to the magnitude of the probability value, but is can also be considered as a matrix, each of the values of which corresponding to the probability values being present in the given position.
  • images 10 of 1280x960 resolution and probability maps 1 1 -n of 320x240 resolution are applied.
  • a convolution neural network trained according to the invention is used for generating the probability maps 1 1 i -n.
  • the neural network analyses the visual information appearing in the image, and on the basis of examining this visual information, it determines the probability values in each position for the various categories.
  • the neural network is adapted for generating probability maps 1 1 associated with each of categories of elements in image the 10, which probability map 1 comprises presence probability values of an element of the given category.
  • the neural network is a convolution neural network, which comprises an input layer 20 detecting inputs from the image 10, at least one intermediate layer 21 , and an putput .layer 23 comprising sub-layers of a number corresponding to the number of the categories and providing the presence probability values.
  • the exemplary convolution neural network comprises a single intermediate layer 21. It can be seen that in the direction from the input layer 20 towards the output layer 23 the thickness of layers increases, but their area gradually decreases, and hence the output layer 23 reaches the size of the probability maps 11 in the directions x - y and the number of the probability maps 11 -n in the direction z.
  • the resolution of the input layer 20 of the neural network is matched with the resolution of the image 10 to be analysed.
  • the input can be the original grey shade or coloured image 10, but preferably in addition to the original image 10, transformed images generated therefrom may also be inputted to the neural network.
  • the input layer 20 consists of three sub-layers; they receive on the one hand the pixels of the image 10 to be analysed, and on the other hand the pixels of the two transformed images generated therefrom by means of transformation.
  • the image to be analysed or the image variants of the same field of vision may also be recorded by various technological methods.
  • Such methods can be bright field, dark field or phase contrast microscopy, polarised and non-polarised photography techniques, images taken in several focal planes, colour or black- and-white images, RGB colour images, images taken by illumination from several angles, images generated by detection, or holographic microscopy.
  • the intermediate layer 21 following the input layer 20 consists of eight sub-layers. In each sub-layer, the adjacent neurons 24 are linked with identical sets of weights to the neuron blocks in the input layer 20 seen by each neuron 24.
  • the identically positioned neurons 24 see the same neuron block in the input layer 20, but their sets of weights are different.
  • the lateral overlap of the neuron blocks in the input layer 20 seen by the neurons 24 is four neurons, i.e. the neuron blocks seen by the neurons 24 located side by side are overlapping by four neurons in the directions x and y.
  • the output layer 23 already has 15 sub-layers, and its lateral (x - y) extension is smaller than that of the intermediate layer 21.
  • the neurons 24 in the output layer 23 see neuron blocks of 6x6x8 size in the intermediate layer 21 , with a lateral overlap of four neurons.
  • Table 1 the number of weights describing the complete network is presented for each layer in the last column.
  • the dimensions of the consecutive layers in the directions x and y can be calculated from the width of the field of vision of the neurons 24 and from the overlap. If, for example, the width of the field of vision of the neuron is 6 and the overlap is 4, then width x is the following in the next layer:
  • x(i+1 ) (x(i) - 6) / (6 - 4) + 1.
  • the training method according to the invention is partly based on the characteristic that convolution neural networks have the same sets of weights in the directions x and y in each sub-layer. Therefore, a more efficient training can be achieved, if these sets of weights are determined by means of training with images smaller than the image to be analysed for a sub-network adjusted to the size of the training image, which training images are characterised by an element associated with only up to one type of category.
  • This is understood according to the invention that in the training image, mainly only one element associated with one type of category or - according to the description below - a so-called uncategorised element appears. In this way, it is not the images to be analysed, but the categories that are trained to the convolution neural network.
  • the sub-network comprises the neurons sensing the inputs from the training image, and neurons of the at least one intermediate layer and the output layer directly and indirectly linked thereto by weights.
  • the trained sub-network can be simply extended in the directions x and y by multiplying the neurons and the sets of weights.
  • the size is to be carried out in a way that the number of sub-layers and the sets of weights remain unchanged.
  • the parameters of the sub-network used in the training are to be selected in a way that they are adjusted on the one hand to the inputs (number of sub-layers of the input layer), the outputs (number of sub-layers of the output layer) and the network structure intended to be accomplished (number of intermediate layers and number of sub-layers thereof).
  • an output layer 23' of an area of 1 x1 , 3x3, 5x5, etc neurons (dimensions x - y).
  • An output layer 23' of an area larger than one neuron is advisable, because the recognition process becomes indifferent to the lateral displacements in the training images, e.g. in the current case to four pixel displacements. This is because in the case of an output layer 23' it is an expectation that the maximum of the output sub-layer (i.e. the highest one of the respective presence probability values) associated with the given category should be larger than the maximum of the other sub-layers. Therefore, an output layer 23' enables more flexible training being indifferent to lateral image displacements.
  • a schematic view of an output layer is shown in Fig. 3.
  • the first sublayer of the output layer 23' belongs e.g. to the category of red blood cells, and the second sub-layer to the category of white blood cells.
  • training images characterised by an uncategorised element is also used to perform the training, where the weights are adjusted in a way that each sub-layer of the output layer 23' of the sub-network provides as low as possible maximum of presence probability. In such a way the mistuning of the training process by the uncategorised elements can be avoided efficiently and also false network responses given to uncategorised and unknown elements are eliminated.
  • Fig. 4 shows a schematic design of the sub-network to be trained, which subnetwork consists of neurons 24 linked directly or indirectly to the neurons 24 of the output layer 23' according to Fig. 3.
  • the parameters of the sub-network are given in the following Table 2.
  • the number of layers in the sub-network and in the neural network to be generated for the analysis are identical, the field of view of neurons in each sub-layer and the size of overlaps are identical and, of course, the number of weights associated with the various layers (z ⁇ x' ⁇ y' ⁇ ⁇ ') is also the same.
  • the extensions of the sub-network layers in directions x and y are obtained.
  • an image database comprising categorised images is compiled.
  • images are produced with a size matching with the sub-network input layer in the directions x and y.
  • the lateral size of the categorised image is slightly more than V2-times the x - y extension of the sub-network input layer 20'.
  • categorised images are preferably cropped from sample images to be analysed in a way that in the centre thereof only an element of up to one category is found, by retouching or by keeping a specified distance from the other elements (therefore, an uncategorised element can also be in the centre of the image in the given case). Therefore, the training images characterised by an element associated with a single category are retouched training images which only comprise the element of the given category in the centre or they are natural training images which comprise the element of the given category in the centre, and in their predetermined environment they do not have an element associated with a different category.
  • categorised images are preferably produced in a way that they are examined individually by an expert, tagging with a label corresponding to the given category the area covered by the relevant element.
  • the twenty typical categories applied for the exemplary mentioned urine analysis are represented by the output layers 23, 23' comprising twenty sub-layers as shown in the figures.
  • Contradictions may make it difficult to train the neural network and may render the process unstable.
  • the training method according to the invention is preferably also characterised by applying roughly to the same amount of training images for each category. It is a characteristic of categorisation based image recognition that particular categories appear extremely frequently and other categories in the given case extremely rarely in the images to be analysed. However, training may not follow the appearance frequency mentioned above, because if a rarely appearing element/category is rarely shown to the neural network, it will neglect and forget the given category. Therefore, it is important that the categories appear in training with roughly the same weight.
  • the invention has been described above for the purpose of urine analysis regarding the images 10 of a urine sample, but of course this does not restrict the applicability of the invention to this technical field.
  • the element recognition and categorisation according to the invention may be preferably used also in further applications mentioned in the introduction, where the need for image recognition and categorisation arises.
  • the invention is not limited to the preferred embodiments described in details above, but further variants and modifications are possible within the scope of protection determined by the claims.
  • the invention is not only suitable for processing two-dimensional images, but it may be used also for the analysis of images generated by a three- dimensional imaging process.
  • the probability maps are preferably also three-dimensional maps.

Abstract

The invention is, on the one hand, method for training a neural network - the neural network being adapted for generating probability maps - each associated with a respective category of elements in an image - on a basis of visual information detectable in the image of a body fluid sample, the probability map comprising presence probability values of an element of the given category, and - the neural network being a convolution neural network and comprising an input layer (20) detecting inputs from the image, at least one intermediate layer (21) and an output layer (23) providing the presence probability values and comprising sub-layers in a number corresponding to the number of the categories, The method is characterised by - carrying out the training by means of training images characterised by an element associated with only up to one category and being smaller than the image, for a sub-network of the neural network, the sub-network corresponding to the size of the training image and comprising neurons (24) of the input layer detecting inputs from the training image, and neurons (24) - linked to these directly or indirectly by weights - of the at least one intermediate layer and the output layer, - and generating the complete neural network with weights (25) resulting from the training of the sub-network. On the other hand the invention is a neural network trained by the above method.

Description

NEURAL NETWORK AND A METHOD FOR TRAINING THEREOF
TECHNICAL FIELD
The invention relates to a method for training a neural network and to a neural network trained with the method.
BACKGROUND ART
There are numerous prior art methods for training neural networks. It is known that the training methods have to be suitable for permitting the neural network to identify the resulting rules in the course of a large number of training steps and to provide the results according to these rules.
Neural networks are especially widely used for example in image recognition methods, for the recognition and categorisation of image elements representing objects, and furthermore preferably for automatic specification of the number of elements belonging to each category. These methods and apparatuses can be applied especially preferably in medical and diagnostic devices, e.g. in automatic analysis of body fluids such as urine or blood. In these methods a large number of training images are fed to a neural network dimensioned in accordance with the size of the images to be recognised and the information therein, and the weights linking the various neurons of the neural network are varied in the function of the correctness of the results provided by the neural network,. One of the best known methods for tuning weights between neurons is the so-called back propagation. This training is carried out in a known manner, and in this respect detailed description can be found in the literature. The process of training can be summarised in that the neural network is 'awarded' if it provides a correct result, and it is 'punished' if it outputs an incorrect one. Methods to train neural networks are disclosed, e.g. in the patent documents US 5,052,043, US 5,796,410, US 5,903,884, US 6,876,966 B1 , US 7,130,776 B2, US 7,286,687 B2, US 7,418,128 B2, US 7,480,409 B2 and US 2009/0196493 A1. Jn the . known solutions, training of the neural network is performed by training images of the same size as the images to be analysed. However, in the case of large size images, the training procedure is extremely time consuming, and an extremely large number of weight values have to be determined during the procedure. The known methods are less suitable for image recognition purposes in which it is necessary to categorise elements appearing in the images on the basis of visual information detectable in the images. This is because in the case of training with full size images, the neural network learns primarily not the particular categories, but those images in which various category elements may be present simultaneously. As a result of all these factors, by the known solutions, the training process for categorisation-based image recognition is less efficient or less tangible or controllable.
A further disadvantage of the known solutions is that they do not comprise steps by which wrong recognitions resulting from uncategorised elements could be managed.
During experiments it has been recognised that if a so-called convolution neural network is applied for categorisation based image recognition, it will be sufficient to perform the training process only for an appropriate sub-network (i.e. network part) of the complete neural network, using training images which are smaller than the images to be analysed, and which only represent up to one category of the elements. According to the invention, 'convolution neural network' means a special type of the neural networks, in which, except for the input layer, each neuron in each layer is in contact with a corresponding neuron block of the previous layer, which neuron block extends over the complete thickness of the layer (in direction z). In addition, in the sub-layers of each layer (in levels consisting of neurons), each neuron is linked with the same weights to the corresponding neurons in the neuron block of the previous layer. The neuron blocks linked to adjacent neurons may in the given case be overlapping. DESCRIPTION OF THE INVENTION
The object of the invention is to provide a training method for neural networks, especially in applications where it is required to categorise the elements appearing in the images. The object of the invention is to provide a training method, which is free of the disadvantages of the prior art solutions to the greatest possible extent. A further object of the invention is to provide a controllable and manageable training method for neural networks adapted for categorisation tasks. An object of the invention furthermore is to provide a training method which results in faster converging and faster adjustment of the weights between the neurons than prior art techniques. An object of the invention is furthermore to provide neural networks generated with the training method mentioned above.
The method according to the invention is defined by the features of claim 1 , and the neural network according to the invention is described by the features of claim 9. Preferred embodiments of the invention are defined in the dependent claims.
BRIEF DESCRIPTION OF THE DRAWINGS
Preferred embodiments of the invention are described below by way of example with reference to the following drawings, where
Fig. 1 is a schematic view of generating a probability map with a neural network according to the invention,
Fig. 2 is a schematic view of the structure of an exemplary convolution neural network according to the invention,
Fig. 3 is a schematic view of the output layer of an exemplary sub-network according to the invention, and
Fig. 4 is a schematic view of the structure of an exemplary sub-network according to the invention.
MODES FOR CARRYING OUT THE INVENTION
The neural network and training method according to the invention will hereinafter be described primarily for the purpose of body fluid analysis, e.g. urine analysis. Such an analysis is an extremely useful practical example of an application where automatic categorisation of elements appearing in photos of samples is required.
According to the invention, an element shown in an image means a visual appearance of any object which can be recognised and categorised. In the photos of a urine sample, objects/elements to be categorised can be by way of example the following:
- bacterium (BAC),
- squamous epithelial cell (EPI),
- non-squamous epithelial cell (NEC),
- red blood cell (RBC),
- white blood cell (WBC), and
- background (BKGND). Of course, in addition to those of above, further elements/objects to be categorised may be discovered in the photos of urine samples. Including the background, altogether and typically 10 to 30 element categories can be set up.
As shown in Fig. 1 , probability maps 111-n belonging to predetermined element categories are generated on the basis of the visual information detectable in image 10. The various probability maps 11 show presence probability distribution of an element of the given category.
The probability maps 1 i-n may be generated also with the same resolution as that of the image 10. However, in the course of elaborating the invention, it has been recognised that for example in the case of high resolution images available in medical diagnostics, it would be extremely time consuming to analyse full resolution probability maps 1 11-n. It has been found that it is sufficient to generate probability maps 1 1 i-n of lower resolution than that of the original image 10, in a way that several pixels of the image 10 are associated with each of the probability values of the probability maps 1 1 i-n. In a preferred realization, a raster point is assigned to 4x4 pixels of the image 10 in the probability map. This probability value represents the presence probability of an element of the given category regardjng the given 4x4 pixels of the image 10. In the course of elaborating the invention it has been proven that a reduction of the resolution does not deteriorate the accuracy of categorisation, because such combined probability values also appropriately represent the presence probability in the given detail of the image, in view of the fact that the typical particle size or object size is larger than 4x4 pixels. The probability maps 11 i-n can be presented also as probability images, each pixel of which carrying visual information according to the magnitude of the probability value, but is can also be considered as a matrix, each of the values of which corresponding to the probability values being present in the given position.
In an exemplary preferred embodiment, images 10 of 1280x960 resolution and probability maps 1 1 -n of 320x240 resolution are applied. For generating the probability maps 1 1 i-n, a convolution neural network trained according to the invention is used. The neural network analyses the visual information appearing in the image, and on the basis of examining this visual information, it determines the probability values in each position for the various categories.
The parameters of an exemplary convolution neural network applicable for generating probability maps according to Fig. 1 and presented in Fig. 2, are given in the following Table 1.
Figure imgf000006_0001
Table 1 On the basis of the visual information detectable in the image 10, the neural network is adapted for generating probability maps 1 1 associated with each of categories of elements in image the 10, which probability map 1 comprises presence probability values of an element of the given category. The neural network is a convolution neural network, which comprises an input layer 20 detecting inputs from the image 10, at least one intermediate layer 21 , and an putput .layer 23 comprising sub-layers of a number corresponding to the number of the categories and providing the presence probability values.
It can be seen that the area of the input layer 20 is slightly larger than the resolution of the image 10. The reason for this is that in order to reduce the edge effect, preferably a mirroring is applied at the edge of the image 10, and through this the minimising of the detrimental effects of edge effect can be ensured in the examination range. The exemplary convolution neural network comprises a single intermediate layer 21. It can be seen that in the direction from the input layer 20 towards the output layer 23 the thickness of layers increases, but their area gradually decreases, and hence the output layer 23 reaches the size of the probability maps 11 in the directions x - y and the number of the probability maps 11 -n in the direction z.
The resolution of the input layer 20 of the neural network is matched with the resolution of the image 10 to be analysed. The input can be the original grey shade or coloured image 10, but preferably in addition to the original image 10, transformed images generated therefrom may also be inputted to the neural network. By way of example, the input layer 20 consists of three sub-layers; they receive on the one hand the pixels of the image 10 to be analysed, and on the other hand the pixels of the two transformed images generated therefrom by means of transformation. The image to be analysed or the image variants of the same field of vision may also be recorded by various technological methods. Such methods can be bright field, dark field or phase contrast microscopy, polarised and non-polarised photography techniques, images taken in several focal planes, colour or black- and-white images, RGB colour images, images taken by illumination from several angles, images generated by detection, or holographic microscopy. Neurons 24 of the intermediate layer 21 following the input layer 20 'see' a 6x6x3 (x, y, z) size neuron block of the input layer 20, i.e. they are linked to the neurons 24 of this neuron block via weights 25. The intermediate layer 21 following the input layer 20 consists of eight sub-layers. In each sub-layer, the adjacent neurons 24 are linked with identical sets of weights to the neuron blocks in the input layer 20 seen by each neuron 24. In the various sub-layers of the intermediate layer 21 , the identically positioned neurons 24 see the same neuron block in the input layer 20, but their sets of weights are different. In the embodiment, regarding the intermediate layer 21 , the lateral overlap of the neuron blocks in the input layer 20 seen by the neurons 24 is four neurons, i.e. the neuron blocks seen by the neurons 24 located side by side are overlapping by four neurons in the directions x and y. The output layer 23 already has 15 sub-layers, and its lateral (x - y) extension is smaller than that of the intermediate layer 21. The neurons 24 in the output layer 23 see neuron blocks of 6x6x8 size in the intermediate layer 21 , with a lateral overlap of four neurons. In Table 1 , the number of weights describing the complete network is presented for each layer in the last column.
As shown in Table 1 , the dimensions of the consecutive layers in the directions x and y can be calculated from the width of the field of vision of the neurons 24 and from the overlap. If, for example, the width of the field of vision of the neuron is 6 and the overlap is 4, then width x is the following in the next layer:
x(i+1 ) = (x(i) - 6) / (6 - 4) + 1.
The training method according to the invention is partly based on the characteristic that convolution neural networks have the same sets of weights in the directions x and y in each sub-layer. Therefore, a more efficient training can be achieved, if these sets of weights are determined by means of training with images smaller than the image to be analysed for a sub-network adjusted to the size of the training image, which training images are characterised by an element associated with only up to one type of category. This is understood according to the invention that in the training image, mainly only one element associated with one type of category or - according to the description below - a so-called uncategorised element appears. In this way, it is not the images to be analysed, but the categories that are trained to the convolution neural network.
The sub-network comprises the neurons sensing the inputs from the training image, and neurons of the at least one intermediate layer and the output layer directly and indirectly linked thereto by weights. Once the training is completed, the trained sub-network can be simply extended in the directions x and y by multiplying the neurons and the sets of weights. Of course, the size is to be carried out in a way that the number of sub-layers and the sets of weights remain unchanged.
The parameters of the sub-network used in the training are to be selected in a way that they are adjusted on the one hand to the inputs (number of sub-layers of the input layer), the outputs (number of sub-layers of the output layer) and the network structure intended to be accomplished (number of intermediate layers and number of sub-layers thereof). In addition, it is advisable to adjust the parameters of the sub-network input layer 20', the intermediate layer 21 ' and the output layer 23' to the size of their training images. It is also advisable if the area (dimensions in the directions x and y) of the output layer 23' of the sub-network is relatively small. Preferably, it is beneficial to apply an output layer 23' of an area of 1 x1 , 3x3, 5x5, etc neurons (dimensions x - y). An output layer 23' of an area larger than one neuron is advisable, because the recognition process becomes indifferent to the lateral displacements in the training images, e.g. in the current case to four pixel displacements. This is because in the case of an output layer 23' it is an expectation that the maximum of the output sub-layer (i.e. the highest one of the respective presence probability values) associated with the given category should be larger than the maximum of the other sub-layers. Therefore, an output layer 23' enables more flexible training being indifferent to lateral image displacements. A schematic view of an output layer is shown in Fig. 3. In the figure, the first sublayer of the output layer 23' belongs e.g. to the category of red blood cells, and the second sub-layer to the category of white blood cells. In an especially preferred embodiment of the invention, training images characterised by an uncategorised element is also used to perform the training, where the weights are adjusted in a way that each sub-layer of the output layer 23' of the sub-network provides as low as possible maximum of presence probability. In such a way the mistuning of the training process by the uncategorised elements can be avoided efficiently and also false network responses given to uncategorised and unknown elements are eliminated.
Fig. 4 shows a schematic design of the sub-network to be trained, which subnetwork consists of neurons 24 linked directly or indirectly to the neurons 24 of the output layer 23' according to Fig. 3. The parameters of the sub-network are given in the following Table 2.
Figure imgf000010_0001
Table 2
As shown in Table 2, the number of layers in the sub-network and in the neural network to be generated for the analysis are identical, the field of view of neurons in each sub-layer and the size of overlaps are identical and, of course, the number of weights associated with the various layers (z χ x' χ y' χ ζ') is also the same. On the basis of the field of view of neurons and the overlaps, the extensions of the sub-network layers in directions x and y are obtained.
For training the sub-network according to Fig. 4, an image database comprising categorised images is compiled. In the course of this, images are produced with a size matching with the sub-network input layer in the directions x and y. Preferably, the lateral size of the categorised image is slightly more than V2-times the x - y extension of the sub-network input layer 20'. Hence it can be avoided that the rotational transformations of the categorised image result in empty sections in the detected area, and therefore a trouble-free application of a rotational transformation in any angle is made available.
These categorised images are preferably cropped from sample images to be analysed in a way that in the centre thereof only an element of up to one category is found, by retouching or by keeping a specified distance from the other elements (therefore, an uncategorised element can also be in the centre of the image in the given case). Therefore, the training images characterised by an element associated with a single category are retouched training images which only comprise the element of the given category in the centre or they are natural training images which comprise the element of the given category in the centre, and in their predetermined environment they do not have an element associated with a different category. These categorised images are preferably produced in a way that they are examined individually by an expert, tagging with a label corresponding to the given category the area covered by the relevant element. The twenty typical categories applied for the exemplary mentioned urine analysis are represented by the output layers 23, 23' comprising twenty sub-layers as shown in the figures.
Producing of images categorised as described above is very time consuming, and in practical applications it is not possible to create more than a few ten thousands categorised images. However, for training the sub-network, much more than, even as many as 10 to 100 million training images are required according to experience. This problem is handled according to the invention by generating the training images by random transformation from categorised images. This transformation is carried out in real time between the selection and the feeding in of the categorised image. The transformation can comprise reflection, rotation, expansion, shrinking and/or noise addition in one or more spatial frequencies, or any combination thereof. It is advisable to set the transformation parameters empirically. It has been .proven by the experiments that it is advisable to determine the transformation parameters also in a category-specific way.
When preparing the image database according to the invention, special attention must be paid to its consistency, i.e. that it should not comprise contradictions. Contradictions may make it difficult to train the neural network and may render the process unstable.
The training method according to the invention is preferably also characterised by applying roughly to the same amount of training images for each category. It is a characteristic of categorisation based image recognition that particular categories appear extremely frequently and other categories in the given case extremely rarely in the images to be analysed. However, training may not follow the appearance frequency mentioned above, because if a rarely appearing element/category is rarely shown to the neural network, it will neglect and forget the given category. Therefore, it is important that the categories appear in training with roughly the same weight.
The invention has been described above for the purpose of urine analysis regarding the images 10 of a urine sample, but of course this does not restrict the applicability of the invention to this technical field. The element recognition and categorisation according to the invention may be preferably used also in further applications mentioned in the introduction, where the need for image recognition and categorisation arises.
The invention is not limited to the preferred embodiments described in details above, but further variants and modifications are possible within the scope of protection determined by the claims. For example, the invention is not only suitable for processing two-dimensional images, but it may be used also for the analysis of images generated by a three- dimensional imaging process. In this case the probability maps are preferably also three-dimensional maps.

Claims

1. A method for training a neural network
- the neural network being adapted for generating probability maps (1 1 i-n) - each associated with a respective category of elements in an image (10) - on a basis of visual information detectable in the image (10) of a body fluid sample, the probability map (1 1) comprising presence probability values of an element of the given category, and
- the neural network being a convolution neural network and comprising an input layer (20) detecting inputs from the image (10), at least one intermediate layer (21) and an output layer (23) providing the presence probability values and comprising sub-layers in a number corresponding to the number of the categories,
characterized by
- carrying out the training by means of training images characterised by an element associated with only up to one category and being smaller than the image (10), for a sub-network of the neural network, the sub-network corresponding to the size of the training image and comprising neurons (24) of the input layer detecting inputs from the training image, and neurons (24) - linked to these directly or indirectly by weights - of the at least one intermediate layer and the output layer,
- and generating the complete neural network with weights (25) resulting from the training of the sub-network.
2. The method according to claim 1 , characterised by adjusting the weights (25) during the training in a way that the highest presence probability value appears on that particular sub-layer of the sub-network output layer (23'), which corresponds to the category of the relevant training image.
3. The method according to claim 1 or claim 2, characterised by performing the training also by training images characterised by uncategorised elements, wherein the weights are adjusted in a way that each sub-layer of the sub-network output layer (23') provides as low maximum of presence probability values as possible.
4. The method according to any of claims 1 to 3, characterised in that the input layer (20) comprises a sub-layer receiving pixels of the image ( 0), and a further one or more sub-layers receiving pixels of one or more transformed images generated from the image (10).
5. The method according to any of claims 1 to 4, characterised in that the training images characterised by an element associated with one category are retouched training images comprising in the centre only an element of the given category, or are natural training images comprising in the centre an element of the given category and not comprising an element associated with a different category in a predetermined environment thereof.
6. The method according to any of claims 1 to 5, characterised by generating each training image from an image database comprising categorised images in a way that the categorised image selected from the image database is subjected to a random transformation, the random transformation preferably comprising reflection, rotation, expansion, shrinking and/or noise addition in one or more spatial frequencies, or any combination thereof.
7. The method according to any of claims 1 to 6, characterised by applying training images of each category to a roughly equal extent during the training.
8. The method according to claim 1 , characterised in that the probability map (11) has a smaller resolution than that of the image (10), and each probability value thereof is associated with a multi-pixel area of the image (10).
9. A neural network for application in an apparatus for body fluid analysis, characterised in that it is trained with a method according to any of claims 1 to 8.
PCT/HU2013/000006 2012-01-11 2013-01-09 Neural network and a method for teaching thereof WO2013104938A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
HUP1200018 2012-01-11
HU1200018A HUP1200018A2 (en) 2012-01-11 2012-01-11 Method of training a neural network, as well as a neural network

Publications (2)

Publication Number Publication Date
WO2013104938A2 true WO2013104938A2 (en) 2013-07-18
WO2013104938A3 WO2013104938A3 (en) 2013-11-07

Family

ID=89990573

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/HU2013/000006 WO2013104938A2 (en) 2012-01-11 2013-01-09 Neural network and a method for teaching thereof

Country Status (2)

Country Link
HU (1) HUP1200018A2 (en)
WO (1) WO2013104938A2 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104751162A (en) * 2015-04-03 2015-07-01 哈尔滨工业大学 Hyperspectral remote sensing data feature extraction method based on convolution neural network
CN105528638A (en) * 2016-01-22 2016-04-27 沈阳工业大学 Method for grey correlation analysis method to determine number of hidden layer characteristic graphs of convolutional neural network
WO2017051943A1 (en) * 2015-09-24 2017-03-30 주식회사 뷰노코리아 Method and apparatus for generating image, and image analysis method
WO2018224852A2 (en) 2017-06-09 2018-12-13 77 Elektronika Műszeripari Kft. Combined bright-field and phase-contrast microscopy system and image processing apparatus equipped therewith
WO2020139835A1 (en) * 2018-12-26 2020-07-02 The Regents Of The University Of California Systems and methods for two-dimensional fluorescence wave propagation onto surfaces using deep learning
WO2020225580A1 (en) 2019-05-08 2020-11-12 77 Elektronika Műszeripari Kft. Image taking method, image analysis method, method for training an image analysis neural network, and image analysis neural network
TWI717655B (en) * 2018-11-09 2021-02-01 財團法人資訊工業策進會 Feature determination apparatus and method adapted to multiple object sizes
CN113420813A (en) * 2021-06-23 2021-09-21 北京市机械工业局技术开发研究所 Method for diagnosing state of particulate matter filter cotton of vehicle exhaust detection equipment
WO2022108884A1 (en) * 2020-11-17 2022-05-27 Sartorius Bioanalytical Instruments, Inc. Computational model for analyzing images of a biological specimen
US11803963B2 (en) 2019-02-01 2023-10-31 Sartorius Bioanalytical Instruments, Inc. Computational model for analyzing images of a biological specimen
CN117520753A (en) * 2024-01-05 2024-02-06 河北中体善建体育产业有限公司 Early warning system and method for ice and snow sports

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5052043A (en) 1990-05-07 1991-09-24 Eastman Kodak Company Neural network with back propagation controlled through an output confidence measure
US5796410A (en) 1990-06-12 1998-08-18 Lucent Technologies Inc. Generation and use of defective images in image analysis
US5903884A (en) 1995-08-08 1999-05-11 Apple Computer, Inc. Method for training a statistical classifier with reduced tendency for overfitting
US6876966B1 (en) 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction
US7130776B2 (en) 2002-03-25 2006-10-31 Lockheed Martin Corporation Method and computer program product for producing a pattern recognition training set
US7286687B2 (en) 2002-10-01 2007-10-23 Siemens Ag Method for generating learning and/or sample probes
US7418128B2 (en) 2003-07-31 2008-08-26 Microsoft Corporation Elastic distortions for automatic generation of labeled data
US7480409B2 (en) 2004-08-13 2009-01-20 Fujitsu Limited Degraded character image generation method and apparatus
US20090196493A1 (en) 2004-10-07 2009-08-06 Bernard Widrow Cognitive Memory And Auto-Associative Neural Network Based Search Engine For Computer And Network Located Images And Photographs

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5052043A (en) 1990-05-07 1991-09-24 Eastman Kodak Company Neural network with back propagation controlled through an output confidence measure
US5796410A (en) 1990-06-12 1998-08-18 Lucent Technologies Inc. Generation and use of defective images in image analysis
US5903884A (en) 1995-08-08 1999-05-11 Apple Computer, Inc. Method for training a statistical classifier with reduced tendency for overfitting
US6876966B1 (en) 2000-10-16 2005-04-05 Microsoft Corporation Pattern recognition training method and apparatus using inserted noise followed by noise reduction
US7130776B2 (en) 2002-03-25 2006-10-31 Lockheed Martin Corporation Method and computer program product for producing a pattern recognition training set
US7286687B2 (en) 2002-10-01 2007-10-23 Siemens Ag Method for generating learning and/or sample probes
US7418128B2 (en) 2003-07-31 2008-08-26 Microsoft Corporation Elastic distortions for automatic generation of labeled data
US7480409B2 (en) 2004-08-13 2009-01-20 Fujitsu Limited Degraded character image generation method and apparatus
US20090196493A1 (en) 2004-10-07 2009-08-06 Bernard Widrow Cognitive Memory And Auto-Associative Neural Network Based Search Engine For Computer And Network Located Images And Photographs

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104751162A (en) * 2015-04-03 2015-07-01 哈尔滨工业大学 Hyperspectral remote sensing data feature extraction method based on convolution neural network
WO2017051943A1 (en) * 2015-09-24 2017-03-30 주식회사 뷰노코리아 Method and apparatus for generating image, and image analysis method
KR20180004824A (en) * 2015-09-24 2018-01-12 주식회사 뷰노 Image generation method and apparatus, and image analysis method
KR101880035B1 (en) * 2015-09-24 2018-07-19 주식회사 뷰노 Image generation method and apparatus, and image analysis method
CN105528638A (en) * 2016-01-22 2016-04-27 沈阳工业大学 Method for grey correlation analysis method to determine number of hidden layer characteristic graphs of convolutional neural network
CN105528638B (en) * 2016-01-22 2018-04-24 沈阳工业大学 The method that gray relative analysis method determines convolutional neural networks hidden layer characteristic pattern number
WO2018224852A2 (en) 2017-06-09 2018-12-13 77 Elektronika Műszeripari Kft. Combined bright-field and phase-contrast microscopy system and image processing apparatus equipped therewith
TWI717655B (en) * 2018-11-09 2021-02-01 財團法人資訊工業策進會 Feature determination apparatus and method adapted to multiple object sizes
WO2020139835A1 (en) * 2018-12-26 2020-07-02 The Regents Of The University Of California Systems and methods for two-dimensional fluorescence wave propagation onto surfaces using deep learning
US11946854B2 (en) 2018-12-26 2024-04-02 The Regents Of The University Of California Systems and methods for two-dimensional fluorescence wave propagation onto surfaces using deep learning
US11803963B2 (en) 2019-02-01 2023-10-31 Sartorius Bioanalytical Instruments, Inc. Computational model for analyzing images of a biological specimen
WO2020225580A1 (en) 2019-05-08 2020-11-12 77 Elektronika Műszeripari Kft. Image taking method, image analysis method, method for training an image analysis neural network, and image analysis neural network
WO2022108884A1 (en) * 2020-11-17 2022-05-27 Sartorius Bioanalytical Instruments, Inc. Computational model for analyzing images of a biological specimen
CN113420813A (en) * 2021-06-23 2021-09-21 北京市机械工业局技术开发研究所 Method for diagnosing state of particulate matter filter cotton of vehicle exhaust detection equipment
CN113420813B (en) * 2021-06-23 2023-11-28 北京市机械工业局技术开发研究所 Diagnostic method for particulate matter filter cotton state of vehicle tail gas detection equipment
CN117520753A (en) * 2024-01-05 2024-02-06 河北中体善建体育产业有限公司 Early warning system and method for ice and snow sports
CN117520753B (en) * 2024-01-05 2024-04-05 河北中体善建体育产业有限公司 Early warning system and method for ice and snow sports

Also Published As

Publication number Publication date
WO2013104938A3 (en) 2013-11-07
HUP1200018A2 (en) 2013-07-29

Similar Documents

Publication Publication Date Title
WO2013104938A2 (en) Neural network and a method for teaching thereof
CN110287932B (en) Road blocking information extraction method based on deep learning image semantic segmentation
US11644395B2 (en) Multi-spectral imaging including at least one common stain
CA2966555C (en) Systems and methods for co-expression analysis in immunoscore computation
US7949181B2 (en) Segmentation of tissue images using color and texture
EP1846869B1 (en) Classifying image features
CN107341488A (en) A kind of SAR image target detection identifies integral method
US20210321963A1 (en) Systems and methods for enhanced imaging and analysis
CN109345456B (en) Generation countermeasure network training method, image processing method, device, and storage medium
Ogura et al. Automatic particle pickup method using a neural network has high accuracy by applying an initial weight derived from eigenimages: a new reference free method for single-particle analysis
CN105917353A (en) Feature extraction and matching and template update for biometric authentication
CN110148162A (en) A kind of heterologous image matching method based on composition operators
WO1995034050A1 (en) Neural network for cell image analysis for identification of abnormal cells
US20060280348A1 (en) Method of screening cellular tissue
CN105894483B (en) A kind of multi-focus image fusing method based on multi-scale image analysis and block consistency checking
WO2020146437A1 (en) Image noise reduction using stacked denoising auto-encoder
CN110826467B (en) Electron microscope image reconstruction system and method thereof
CN109191434A (en) Image detecting system and detection method in a kind of cell differentiation
CN112215217B (en) Digital image recognition method and device for simulating doctor to read film
CN116612472B (en) Single-molecule immune array analyzer based on image and method thereof
CN112348059A (en) Deep learning-based method and system for classifying multiple dyeing pathological images
CN115546605A (en) Training method and device based on image labeling and segmentation model
Wong et al. Segmentation of additive manufacturing defects using U-Net
CN111915602B (en) Steel material tissue quantification method combining EBSD and deep learning method
CN113673556A (en) Hyperspectral image classification method based on multi-scale dense convolution network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13707027

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
122 Ep: pct application non-entry in european phase

Ref document number: 13707027

Country of ref document: EP

Kind code of ref document: A2