REPORT ON PERCEPTRON ((Sanitized)AND CONFLEX(Sanitized)) CONCERNING AUTOMATED IMAGE RECOGNITION

Document Type:

CREST

Collection:

NGA Records (Formerly NIMA)

Document Number (FOIA) /ESDN (CREST):

CIA-RDP78B04770A002300030029-4

Release Decision:

RIPPUB

Original Classification:

Document Page Count:

114

Document Creation Date:

December 28, 2016

Document Release Date:

March 30, 2005

Sequence Number:

Case Number:

Publication Date:

April 2, 1963

Content Type:

File:

Attachment	Size
CIA-RDP78B04770A002300030029-4.pdf	8.49 MB

Body:

Approved For Release 2005/091.02,::,J0FrUp04770A0W00030029-4 NIMINtAND PPM SUBOCCT Director, JPXC Assistant for Plans end Development Report on PMPCMPTPOP and Image Recognition 2 April 1.963 oneerflin Aut 1. Selected neibers of the Plans and Development Staff were present rt briefinge on the 19th and 22nd of Perch. The following paragraphs contain abstracts of these briefings, a summary, and retcamendations for future action. 2. presented a briefine trn O9 to 1200 on 19 March 19b3 mom 38467. In addition to sber of the AND Staff, there were representatives from PIO, Ta), Air Force, Army and Navy Detachments, USPPIC, (ER, and Bu Peps. Security level -was unclasei a. PIRCUTRON is a means of automated recognition based on statisti(T1 separability in cognitive systems (1) It may be described as a "biological' computer system consist- of a sensory matrix coupled through ? complex continuously variable weighting system to a general purpose digital computer, programmed in a fashion which siwluutes brain mechanisms. develomental history began in 1958 eild on the basis of concepts originated by Extensive research has been performed since that time in the follow realms: (a) Implementation and evaluation of the PRACEPTRON concept. (b) Application of the PERCEPTRON to photo interpretation. (0) Application of the PLICEPTRON to character re-2ognition. (4) Preprocessing of photo reconnaissance date. This develOpment program has been characterised by fundamental inventi ottani, She process is deliberate, comprehensive and slow. This research Declass Review by NGA. Approved For Release 2005/05/02piA_-_REIPZ9p 4770A002300030029-4 25X1 25X1 ? 25X1 25X1 t?r---- Excluded lra, ::,113DaiLl dewnErzlia and declassifict,Iliz --- ?.. itmlnrjAL Approved For Release 2005/05fE rn uPumwrai504770A002300030029-4 -2- frame of reference for all development in this field.. There is *pil- e yet unreadhed stage in the development of the principle which must ed before the majority of developmental effort can be directed toward .pplication. The main effort of this program is now being directed toward eprocessine of the image. The processes of object detection, isolation and normalisation are being investigated through systems other than PERCEPTRON, which is used solely for the recognition process. This briefine was held tripe 1400 to MO, 22 March 1963/ I In addition to metbers of evelopment Staff,' Iof USNPIC anl of BuwePa 25X1 sent, Security level was unclassified. a. cox= I is said to be e "conditioned reflex Computer. (1) It may be described as a "biological" computer eystee consiet- ing of a sensory matrix, elements of which are systematically activated in a large number of different combinations, coupled to a special purpose digital computer programmed in a fashion which simulates brain mechanisms. (2) Its developuentat history began late in 1960 at ender En 25X1 Air Porte study contract. The prototype COMPLEX I system wus eompleted cation of pictures ofl 'personnel was presented. Related reeeareh in ftvembar 1962. Au_mplesssive demonstration of learning and /dentin- has been performed in the following reales: (a) Basic studies of biological response. systems. CO Application of the COOL= system to character recognition. (c) Application of the COMPLEX aystem to photographic image recognition. (d) ?renormalisation of photographic reconnaissance data. This is apparently a derivative of the original TielEMSON develop- ment. personnel have utilized these principles to design an ingeniouB, compact system which may have considerable application in the image recognitior field. Their prenormalisetion syytem is based an slit scanning of a given image field at a large nuther of different attitudes and the consequent pattern of signals generated by such scans. The nuOber and complexity of developments in this field indicate e Plana and Develegummt Staff must acquire more information before a cpiete program for developnont can be established. Mmplicatioec of the Observations made to date are as follows: Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/OLLPIZAFFQ104770A002300030029-4 -3- a. The type of approach appeers to dhow nueb ze pro- Of eventual Application to photo reconnaissanee analysis then the pure di itsl scan systems such as AMATO. b. effort is a conservative, compre- hensive, Tundameatal research approach which is likely to be slow in yielding directly applicable techniques or hardware. It does however, aerve as e pri- letry reference with a broad base and a large amouot of accomplished investization. Ry the eame token it is likely to yield valuable results from continued research.. c. The effort is representative of the relatively young research thoility. There is evidence of competent, highly motivated, dynamic developeeel being accomplished. It is possible thatr--land other orgamization3 of this 25X1 type end caliber will produce the first practical devices for automated tareiet recognition and they may outstrip Iin developing mnme special aspects of 25X1 more efficient electronic legic. In this regard it should be pointed out that hes invest ad a renormalinetion system very similar to the one preeent1L) eaused by multipleges within a single field. _771 under study at t was eventually rejected due to the 7:71: appears 25X1 to overcome this limitation, but due to the limited know e icqutred to date, a full assessment cannot be made. 5. . Immediate attention will be given to resolution of W! er of different endeavors being pursued in thi effort should definitely be supported for it level, amii there is a strong possibility that pars - by a raciLity such as Ishould also be supported. flower,25X1 is felt that further investigation is imperative before a decision can be resdered on the latter aspect ._Zn eeteblishing priority for funding it ia import:13# to note that thai Iprogram is currently being supported by the. Air Fores whereas th effort is to be terminated in June because the Navy ean no longer fund the program. Approved For Release 2005/05/02 : CIA-RpP7804770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 DESIGN OF A PHOTO INTERPRETATION AUTOMATON* SUMMARY (-71./wri The paper describes a system for automatic recognition of simple and com- plex objects in aerial photographs. Preliminary results from a general-purpose computer implementation of critical portions of the system are presented. Hope for achievement of a practical device is high because the basic pattern recognition capability required in the system is, to a great extent, based on the present state of the art. INTRODUCTION The extremely large volume of photographic material now being provided by reconaissance and surveillance systems, coupled with limited, but significant, successes in designing machinery to recognize patterns has caused serious con- sideration to be given to the automation of certain portions of the photo interpreta- tion task. While there is little present likelihood of successfully designing machines to interpret aerial photographs in a complete sense, there is ample evidence to support the conjecture that simple objects, and even some complex objects, in aerial photographs might be isolated arid classified automatically. 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02: CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation Even if machinery, produced in the near future, can only perform a preliminary sorting to rapidly winnow the input volume and to reduce human boredom and fatigue on simple recognition tasks, the development of such machinery may well be justified. The supporting evidence for the conjecture that simple objects can be identified in aerial photographs is based on work which has shown experimentally that present pattern-recognition machinery - indeed that which existed several years ago - can be applied to the recognition of silhouetted, stylized objects which are militarily interesting. Murray has reported just such a capability for a simple linear discriminator'. Since the information required to design more capable recognition machines is readily available, it might seem that there is no problem of real interest remaining to make a rudimentary photo- interpretation machine an accomplished fact. This, unfortunately, is not so. One of the most difficult problems is that which is referred to as the segmentation problem. The problem of pattern segmentation appears in almost all interesting pattern recognition problems, and is simply stated as the problem of determi- ning where the pattern of interest begins and ends (as in speech recognition problems) or how one defines those precise regions or areas in a photo which constitute the patterns of interest. The problem exists whenever there is more than one simple object in the entire field of consideration of the pattern recognizer. The situation appears almost hopeless when one finds patterns of widely varying sizes, connected to one another (in fact or by shadow), enclosed within other patterns, or having only vaguely defined outlines. See "Perceptron.Applications in Photo Interpretation, " A. E. Murray, Photo- grammetric Eovea nprin ezing Appr r-oer rteleateSidagE: d/k-i4DP781304770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation This paper constitutes a report on a system which has been conceived to solve some of these problems. It is being tested by general-purpose computer implementation. The system discussed represents one of several possible approaches to the problem and had its design focused towards the use of presently known capabilities in pattern recognizers. No special consideration has been given, at this time, to methods of implementing the device; however, the entire system can be built in at least one way. SYSTEM PRINCIPLES Figure 1 is the basic block diagram for the system. It has evolved from evaluation of possible approaches suggested by research conducted at pattern recognition work of others, and techniques successfully used in other problems. As is evident from Figure 1, objects of interest have been categorized in two different ways. First, simple objects, such as buildings, aircraft, ships, and tanks have been distinguished from complexes, or complex objects. Second, simple objects have been categorized, according to their length-to-width ratios, as being either blobs (aircraft, storage tanks, buildings, runways) or ribbons (roads, rivers, railroad tracks). As shown, the detection of simple objects is accomplished separately for ribbons and for blobs. In the work reported here the blob channel - from the input end through the identification of a few complex objects - is receiving the major attention. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 BLOB ISOLATOR AND STANDARD! ZATI ON BLOB OBJECT ASSOCIATION AND I DENT! Fl CAT! ON OBJECT OUTPUT COMPLEX OBJ ECT 41-- 01!TPUT INPUT BLOB OBJ ECT DETECTION V V V COMPLEX OBJ ECT ASSOC! ATI ON AND IDENTIFICATION A A A PHOTO ( SCALED) RI BBON RI BBON OBJ ECT RI BBON OBJECT DETECTI ON -to ASSOCIATION AND IDENTIFICATION CONTINUITY INTERPOLATOR 010 ON Figure 1 PHOTO INTERPRETATION SYSTEM BLOCK DIAGRAM Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation The preprocessing which is carried out in the first portion of the system solves several of the problems inherent in the use of a simple pattern-recognition device to aid in the photo interpretation problem. Briefly, objects are to be detected, isolated, and standardized so that they can be presented separately (not necessarily sequentially) for identification. The function performed at the object identification level is that of identifying the blobs which have previously been detected, isolated, and standardized. The input material to this level or state consists of black-on-white objects. As has been previously indicated, existing devices are fundamentally capable of ac- complishing the identification task. At the complex object level, the location and identification information available from the simple object-level outputs is combined and appropriately weighted to identify objects at a higher level of complexity. An illustrative example is the combination of aircraft (simple objects) near a runway (another simple object) and a group of buildings (each a simple object) to determine the existence of an airfield. In the following sections the basic steps in the preprocessing sequence will be described in more detail and some illustrations from current computer studies will be discussed. The most difficult part of the problem, by far, is that of detection. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation OBJECT DETECTION A study of sample aerial photography suggests three ways in which images of objects of interest differ from their backgrounds: a.) points on objects may differ in intensity from the intensity characteri- zing the background. b.) objects may be (perhaps incompletely) outlined by sharp edges, even though the interior of the image has the same characteristic intensity as the background. c.) objects may differ from background only in texture, or two dimensional frequency, content. Examples of the first two kinds of objects are shown encircled in Figure 2. There seem to be many fewer examples of objects which differ from background solely by texture. This class of objects would be much larger if our definition of object were broader, including, for example, corn fields. Perhaps the most useful area in which spatial frequency content can be put into use is that of terrain classification. Terrain classification, as will be noted again later, can play a significant role in the final identification of our narrower class of objects. For detection of objects in classes a) and b) above, we have been proceeding experimentally to determine the capabilities of simple, two-dimensional numerical filters, some nonlinear and some linear. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 2 EXAMPLES OF OBJECTS DEFINED BY INTENSITY CONTRAST (0) AND BY EDGES (LI) Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation For initial experimentations', the object filters for discrimination based on intensity contrast (class a objects) were designed as shown in Figure 3. Square apertures ("picture frame" regions) were used to compute intensity information which was then compared with the intensity of the point at the center of the square, A, to determine if the central point differed sufficiently in intensity from its back- ground to qualify as being a point on an object. A computing method equivalent to the following was used. Each point in the input photograph was surrounded by a frame one point thick, and of width d (Figure 3). The mean, m , and standard deviation, cr- , of the intensity of the points in the frame were then computed. If or A m # max , Ker) A .= m ? max (1 , Ka-) (1) the point was recorded as an object point. Several different frame sizes were used in order to detect objects of different sizes. >:c The experimental work reported was carried out using IBM-704 computer programs which were prepared to process photographic material. An input device was constructed to scan and quantize photographic information for input to the computer through the "real-time" package, and the computer printer was used to provide pictorial output. A commercially available facsimile transmitter capable of 50 lines/inch resolution and a commercially available analog-digital samples and encoder form the basic input device package. In addition, the necessary isolating and synchronizing circuitry has been designed, and constructed to permit the output of the facsimile machine (through the encoder) to be read by the "real- time input package" on the 704 computer. Quantization and processing com- puter programs have been written and are in operation. These programs cause photographs to be sampled every 50th of an inch, quantized in sixteen intensity levels, and stored on magnetic tape. A relatively crude, interim- nature, output has been arranged by using combinations of symbols available in the co rrifaPPrOvietliEW Releatn%2005g15/0Pe: ciArliQUeEE11477(14WOOMO2A-Aof four different levels of intensity. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 A .41 Figure 3 FILTER FOR DETECTION ON THE BASIS OF INTENSITY CONTRAST Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 'Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 4 ORIGINAL PHOTOGRAPH Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 POciroved For Release 2005105102 .. CIA-R0978604770A00230004,_ ? ? ? el.. ? .: ?,? ....... ...i"::?? ?:- ? ? : ?.:* ::ii? . ? ??* ??? ? ?? e . ?:. - .. . ??? ?? ? .. ? ? ? "?ii: .. o .0- ?11?4 . . : .....: - :. Figure 5 FILTERED PHOTO, d = 8 ??? ?? ? .. s ? .. ---,wed For Release 2005105102 : CIA-RDP7g3047701002300030029-4 Approved For Release 2005/05/02: CIA-RDP78604770A002300030029-4 ?? ?? ??.? Li- ? .. Figure 6 FILTERED PHOTO, d = 16 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 " ?: "..." ? " " . "" ... ? ? ?::: :.: ::?" ???? ???? ?C '::* :Iiii ** ? *::* Figure 7 FILTERED PHOTO, d = 32 ..... ..? + ? :: ? ? ? Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation Figures 5, 6, and 7 indicate the results of applying three filters of the type described above to the photograph shown in Figure 4. The frame widths were 8, 16, and 32 points, and K of equation (1) was 2. The points which satisfied the inequalities of (1) were printed as asterisks. The three figures illustrate that objects of different sizes are detected best (with least shape distortion) by filters of different size. This is especially noticeable for the building complex in the lower half of the photograph. In Figure 6 (d=16), the buildings are reproduced in perfect contrast about as well as can be expected considering the coarseness of the input information. In Figure 5, the buildings are broken up into segments, while in Figure 7, they tend to run together. The seaplane launching ramp at left center is missing completely from Figures 5 and 6, while the filter which matches it well in size reproduces it in Figure 7. It is important to note that the recognition logic used requires only that an object be detected by a single filter. Distorted versions which are detected by other filters will be rejected. Simultaneously with experimentation in detecting objects using the object- point-intensity criteria, similar experiments are being carried out to detect objects by outlining them. There are three steps in this process; (1) object edges must be "detected", (2) gaps in the outlines of objects must be filled in, and (3) for compatibility with the first method of object detection in later system stages processing all detected objects, the outlined objects must have their interior space filled in to produce silhouettes. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation The basic operation in edge detection is, of course, differentiation. The earliest results were obtained by centering a numerical filter of the shape shown in Figure 8 about each image point. c A d Figure 8 BASIC FILTER FOR EDGE DETECTION The values of intensity, ( d-C )=AX and ( a-b )= uiy were determined and the sum of their magnitudes was taken as the gradient associated with the center point, A. A similar filter with nine elements is now being tested with superior results. This filter has the form shown in Figure 9. a b c d e f g h i Figure 9 IMPROVED FILTER FOR EDGE DETECTION Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation Now the difference in the x direction is taken to be AX # 74' * /a # k ,/ and the difference in the y direction as 4), ( a sh c A, 1) (2) (3) Thus first differences are being used, as before, but a three-point average of intensity is used to establish the intensity on either side of the central point. The magnitude of the gradient associated with point e should, of course, be Igraci I p-x.t. + y z (4) The previous approximation to the true form Mr:67(..g Ay' has been improved over the simple sum of magnitudes in that we now use Iyrca1(1 CRP- of I XI (3711an er IAXI Ay') (5) If object detection by identification of edges is to be successful, one must plan on completely outlining objects of interest. In many cases, of course, there will be gaps in the outlines of objects as derived by edge detection. One procedure Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation currently being evaluated for this gap-filling job is described below. It accounts for the two factors which are most important in deciding whether to fill in a point of not; that is, such an action requires both proximity in intensity to the threshold value and proximity in space to at least one other super-threshold point. After gradient computation, as described above, the complete image, made up of points computed by Eq. (5), is thresholded, eliminating low gradient points. The "influence matrix" shown below is then centered over every point in the thresholded gradient image (i. e. it is centered over high gradient points), II 12 13 14 15 16 17 18 19 Figure 10 "INFLUENCE" MATRIX FOR GAP-FILLING and the numbers IP 1, I3' ---- are added to the values in the prethresholded form of the gradient image. If any point covered by the influence matrix now exceeds the previously used threshold, that point is "filled in" as a high gradient or edge point. It would, of course, be possible to train a recognition device to identify outlines of simple objects, but a much simpler system will result if outlined Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation objects can be simply converted to solid objects similar to the silhouettes pro- duced by the annular filter detectors. This can be accomplished very simply by forming the logical complement of the thresholded, edge-detected, binary picture and then operating on the complemented picture with the object isolator programs. OBJECT ISOLATION Originally computer routines which traced along the edges of silhouetted objects were planned for use in object isolation. This technique for isolation, however, does not solve the problem of how to extract the interior portion of the traced-out object from the background in any neat fashion. A different technique, devised by simultaneously traces through the interior of objects and records these elements in a frame for separate storage. At this stage, that is, after isolation, all images of objects are stored in binary form, in separate frames, and in their original size, orientation, and location within the frame. OBJECT STANDARDIZATION Standardization involves simply the translation of the binary image of the object so that its center of gravity coincides with the center of the frame and rotation of the image so that one of its principal axes of inertia is vertical. Recently, the programs being used in feasibility studies have been modified so as to provide scaling of all objects to the same maximum dimension. OBJECT RECOGNITION In the system being discussed, recognition of simple binary images, after detection, isolation, and standardization, will be accomplished by a linear dis- crimination device, i. e. by comparing the weighted sum of a set of property Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation values to a threshold. The weights used are determined by exhibiting a sequence of patterns whose classification is known and adjusting the weights when classi- fication is incorrect, according to prescribed algorithms, until all patterns in the sequence are correctly classified. Thus, the device is "adaptive" and "learns. The properties may be thresholded sums of intensity at randomly selected points in the preprocessed image, or they may be more "objective" properties, that is they may be measured values of such determinable features of the pattern or image as maximum extent, area, or moments about principal axes. Certainly, use will be made of the size, area, and moment information derived during the standardization process. The non-determinable properties mentioned earlier (thresholded sums of intensity at randomly selected points in the image) have the appeal of being very simple to derive and of being of demonstrated usefulness in classification problems.. The ability of a system using such properties to generalize over pattern distortion and small translations has not yet been defin.ed to our satisfaction. A recognition system using these non-determinable properties has been referred to by Rosenblatt as a simple perceptron. In our experimental work to date on this particular problem we have attacked the multiple class recognition problem by performing a set of dichotomizations. Some data on recognition capability have been gathered for synthesized patterns of the type to be produced by the preprocessor, but they are insufficient to make an explicit statement of capability at any reasonable confidence level. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 11 ORIGINAL PHOTOGRAPH Figure 12 PROCESSED PHOTOGRAPH AFTER OBJECT DETECTION AND LOW-PASS FILTERING 1111111' Figure 13 ISOLATED SILHOUETTES FROM Figure 14 STANDARDIZED FORM OF ISOLATED PROCESSED PHOTOGRAPH SILHOUETTES Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation Two types of information are fairly readily available in the system and have apparent use in classification but have not yet been used. Thus, we could use the silhouetted image of an object to mask out all but that object in the original full gray scale image and then derive one or more object properties available in the original image. (For example, one might scan the interior of an object, to derive some measure of its interior complexity). This technique makes available recognition properties other than those based on shape. The second possibility is to use terrain classification information for the immediate back- ground of an object to aid in the classification of that object. Certainly a final system might find such information useful. ILLUSTRATIVE RESULTS The entire sequence of preprocessing operations (including detection by intensity contrast, but not object detection using edges) is illustrated in Figures 11-14. These results were derived from experiments which use general-purpose digital computer programs to implement the entire sequence of operations. The first, Figure 11, shows the original aerial photo. It has been quantized spatially 25X1 and fed into IBM-704 computer through the facsimile input device. After processing by the intensity contrast filter, the binary photo of Figure 12 is pro- duced. (The output is the 704 printer; an asterisk is printed in each 1/50" x 1/50" cell of the original photo which is a point on an object.) We found that some additional low pass filtering is very useful in making objects "hang" together, in filling in imperfections in silhouettes, and in reducing the number of small col- lections of points which appear but which are really not objects. After this Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation filtering (a simple, low-pass, two-dimensional filter is used) and after eliminating collections of very few points, the binary photo was subjected to the isolation programs. This operation produced the frames shown in Figure 13. Each of these several frames from the silhouetted photo was then subjected to the rotation and translation. programs and the corresponding frames of Figure 14 produced. Now recall that standardization processing (1) fixes the center of gravity of a blob within the frame, (2) rotates it to align a major axis of inertia with the vertical in the frame, and (3) adjusts scale factor to roughly fill the frame. All information about how much translation, rotation, and scale change has of course been preserved for use in the recognition process (size is an input to simple target recognition, while location and orientation are inputs to target-complex recognition). Examining Figure 14 we find one odd looking building-shaped blob which must be explained. Referring back to Figure 13 we can locate the s'ource of this standardized object, a small collection of points. Mentally treating each of these points as a square, rotating and enlarging the resulting shape shows that the standardization routine functioned properly. Scale change information would prevent recognition as a building. CONCLUSIONS In this paper approaches to the preprocessing portions of a photo interpretation automaton have been discussed. Clearly, some extremely important evaluative work remains: 1. Detection capability must be quantitatively defined. This first requires that some plausible criterion or criteria for this capability be defined. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 ? ? ? 25X1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Design of a Photo Interpretation Automation 2. Recognition capability must be quantitatively defined. Here there exists a body of evidence that recognition of silhouetted objects is within the recognition capability of current state-of-the-art systems. We are currently applying measures similar to probability of detection and false alarm rate to the definition of recognition capability for a property-list, linear discriminator type of system. 3. Implementation problems for a prototype system must be solved. Our IBM-704 work is for feasibility only. We have kept implementation problems in mind during the current system studies and have carefully avoided using system elements which are unduly complex. As an example, more complicated two-dimensional filters for object detection represent a very real temptation, yet we have exercised restraint and used only the simplest ones which we felt held any hope. What has been achieved is a demonstration that a plausible system, combining current state-of-the-art pattern recognition capability and simple two-dimensional preprocessing operations, can be stated in specific terms and that it represents the very real and likely prospect of providing automated aid to photo interpreters. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Illustrations to Accompany Presentation On INVESTIGATION OF PERCEPTRDN APPLICABILITY TO PHOTO-INTERPRETATION (Project PICS Washington, D. C. December 20, 1962 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 2005/05/02 : CIA-RDP78604770 V2300O Approved For RTIrLTree2905/8EV itTieapalipg770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 2 EDITED VERSION OF FIGURE I Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 App oved For elea e 2 5/0 02 : APERTURE SIZE (a) DP7 B04770A 0230 030 29-4 APERTURE SIZE ( b) ApprobjliFer1Reletkilitil2005/05AMR: 604PRIDMI3C14/117(AA092090030229-4 1672 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 1673 Figure 1.1. OUTPUT OF GAP FILLER APPLIED TO FIGURE 3(a) Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 ? I 1.1.,????? 1 ( 1 P gyp ;4e Figure 5 OUTPUT OF ISOLATOR APPLIED TO FIGURE Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 I 14 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 i, , , , , , 4 , , ,,. 4.?.1 t 4 t Figure 6 OUTPUT OF STANDARDI ZER APPLIED TO FIGURE 5 Approved For Release 2005/05/02 : CIA-RDP78B04770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For &ftr3td2g05A)E5Y02N ACU4'-01/1101ABIB14770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 8 EDITED VERSION OF FIGURE 7 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 APERTURE SIZE (a) APERTURE S I ZE ( b) Figure 9 ANNULAR FILTER OUTPUT FOR TWO APERTURE SIZES Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 10(a) OUTPUT OF KOLMOGROV-SMIRNOV FILTER APPLIED TO FIGURE 8 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 3 1679 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Figure 10( b) OUTPUT OF KOLMOGROV-SMI RNOV FILTER APPLIED TO FIGURE 8 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 1680 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 *or ? if 1 Figure (0(c) OUTPUT OF KOLMOGROV-SMI RNOV FILTER APPLIED TO FIGURE 8 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 .111. 11... ? 1681 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 t , rLI? istF 4 1 7 : T.1 ?????????,.- , t 1 * I ? ; , , 171 *kr , 11H i dr" I I 11 ????????14* TM ,r ....??????????????? ?????1*.??????? ? ? ? Figure I I SAMPLE OF OBJECTS USED IN RECOGNITION EXPERIMENTS Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 1682 Approved For Release 2005/05/02: CIA-RDP78604770A002300030029-4 RESULTS OF RECOGNITION EXPERIMENT Synthesized Objects Correct Pattern Classification Total Number Number Correctly Classified Number Incorrectly Classified Recognition TU 104 60 60 o 100.0 IL 18 60 60 o 100.0 LA 60 60 60 o 100.0 F 102 60 60 o 100.0 SHIPS 60 59 1 98.3 BLDOS 90 86 4 95.5 TANKS 60 59 1 98.3 Object Total 450 444 6 . 90.7 Other 270 262 8 97.0 Object Detection Probability m. .987 False Alarm Probability m .030 COMPLETE RECOGNITION RESULTS 8 H E T A 0 Recognized N H fri As tcg pl . E-4 0 Correcti Classification TU 104 bo IL 18 60 LA 60 6o F 102 60 SHIPS 59 1 BEGGS 86 4 TANKS 59 1 OTHER 2 6 262 1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25 June 1962 PAPER PERCEPTRON Notions of random connection generalization and learning have dominated discussions of the perceptron to the point of obscuring its basic principles of operation. Basically, the concept is simple; the mathematics describing its operation oftentimes appears complex and obscures the basic simplicity of the central ideas. The purpose of this discussion is to explain the perceptron concept by showing how it works. A simple paper model (attached as a series of six pages to this discussion) is used as a means of demonstrating its classification function and illustrating the training process. The perceptron can be divided into three basic sections - sensory, discriminatory, and response. Figure 1 shows a diagram of a simple perceptron. The sensors (or S-units) respond to stimuli from the machine's environment by producing a unit voltage or not, depending on the level of stimulus. The sensors, arrayed in an orderly pattern, are connected in a semi-random fashion to the discriminating layer (or A-units). Several sensor leads are connected to each A-unit in such a way as to produce a sum of stimuli from sets of sensor points. The model attached is a simplified paper representation of perceptrons which respond to light. Naturally, a perceptron of such small size cannot be expected to perform sophisticated tasks but two letter discrimination within a limited field is a suitable task for demonstration purposes. In the model, S-A unit connections are represented by the clear squares in each of the ten separate A-unit masks (page two of the model). Every opening indicates a connection between that A-unit and a sensor point occupying that position in the sensor group. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 PAPER PERCEPTRON 25 June 1962 -2- The A-units serve to totalize the voltage input from all the sensors connected to them and present this sum to a thresholding device. Page one of the model shows the stimuli the machine is trained to classify. Properly trained, it should be able to recognize the difference between all "P's" and all "E's". Each letter is displayed in four positions to demonstrate the flexibility of the perceptron and its ability to cope with letters which do not maintain a single position. To see how the A-unit operates, we put the stimuli page under the A-unit page (page two of the model) so that the number of the stimulus appears in both the top and bottom pilot windows of the A-unit. Since the dots on the letters represent S-units stimulated, and the holes in the A-unit diagram represent S-A connections, it is apparent that a count of the dots visible through the squares represents the total voltage the A-unit receives. This total has been computed for each unit and stimulus, and the result has been entered in the upper matrix on page three of the model. Thus the matrix contains the sums of the input voltages at each A-unit, before thresholding, when the various stimuli are shown to the sensors. The summation voltages are then thresholded to produce a "unit" voltage if the threshold is exceeded, a "zero" voltage in all other cases. In the model, 6 = 31 was selected for the threshold. The lower matrix on sheet two of the model shows the output of each thresholding unit for each stimulus, In this matrix, each stimulus has its own unique binary number made up of the thresholded A-unit outputs taken in order. This suggests that there could be a way of training the perceptron to recognize the difference between the letters. It is apparent that the larger the number of S-units and A-units, the more patterns could be assigned a unique binary number and thus the greater the scope of the perceptron. Indeed, larger perceptrons have capabilities far beyond those indicated in this simple model, We train the perceptron to recognize the letters by adjusting "weights" or voltage multipliers which connect the A-units to the response unit. When- ever the thresholder of any A-unit puts out a voltage, this voltage is multiplied by a weight which may give a positive or negative product, depending on its value. The response unit merely totals all the A-unit voltages and Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 PAPER PERCEPTRON 25 June 1962 thresholds the sum. If the A-unit sum plus the threshold (10 units in the model) is equal to or greater than zero, the response unit puts out a positive unit of voltage. Negative sums elicit negative unit voltage outputs and "zero" sums, of course, give no output. The machine is trained to distinguish between two letters, by adjust- ing the weights so that the response unit gives plus values for all letters of one type and minus values for all letters of the other type. Pages four and five of the model are a representation of this training process. Page four is a mask of the thresholded A-unit responses for each stimulus made from the information in the second matrix of sheet three, Each clear part in both the left and right hand columns of each stimulus mask indicates the A-unit which produced a plus "one" output. By placing the mask on page four of the model so that the stimulus number on the training cycle page appears in the pilot hole of the proper mask, the weight associated with each A-unit can be seen opposite the A-unit numbers. By totaling the weights and adding the threshold just as a physical machine would, the sum ( Z ) which the response unit receives is obtained. If the sum is of the wrong sign, the weights involved must be changed. It has been shown that it is best to change the weights so the sum is as great in the proper direction as it was in the wrong direction, This change ( d ) is divided among the N weights involved and each is changed by . These new values are recorded in the second column of the mask and the new sum is listed opposite 2c, . Starting with the weights all at zero, the machine is exposed to P1 as the first stimulus in the training sequence to which it responds correctly. Using the stimulus E1 and the same weights, the machine misclassifies and must be corrected by adjusting the weights using the procedure just described until a correct reading is obtained. As the sheet shows, the machine was exposed to all the stimuli in an orderly fashion until the perceptron correctly identified all the stimuli, On page five of the model, only the stimuli incorrectly classified during the training cycle are shown for the sake of brevity. The final column (labelled Wi) contains the final weights Page six shows that the perceptron now correctly classifies the eight stimuli. Now the tasks used for this demonstration are quite elementary. Extensions to more sophisticated tasks would demand a much larger perceptron, Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 PAPER PERCEPTRON 25 June 1962 -4 which would become unmanageable for paper computation. Thus, this exercise can only be thought of as an exposition of principles of operation, LTLH-1/rmm Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 SENSORS 1 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 1ST LAYER DISCRIMINATION T THRESHOLD ?CCT 2 ? DISCRIMINATION WEIGHTS 0, I ) 5 6 7 ? ? 9 e9 10 ? THRESHOLD CCT (0,1) THRESHOLD CCT THRESHOLD CCT (0,I) wi W3 W5 W9 Figure I PATTERN RECOGNITION CONCEPTS Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 RESPONSE THRESHOLD Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 INPUT STIMULI P3 ? ? ? ? ? ? ? ? ? ? ? E1 0 0 0 ? E1 P2 ? ? ? ? ? ? ? ? ? ? ? ? P2 ? ? 0 ? ? ? ? E2 ? ? ? ? ? ? ? ? ? ? ? ? 6 ? ? ? ? ? 6 ? is ? p. ? ? E3 ? ? ? ? ? ? ? ? ? ELI ? ? ? 0 ? ? PAGE ONE Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 DI SCRIM! NATI ON UNI TS A10 PAGE TWO Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 "A" UNIT NUMBER MATRI CES SUMMATION MATRIX STIMULUS IDENTIFICATION PI P2 P3 P4 El E2 E3 E4 1 2 5 3 1 3 4 3 2 2 2 I 3 3 4 2 2 2 3 3 3 4 3 2 2 4 3 4 2 3 3 2 3 4 3 2 5 4 It 2 2 It 5 1 1 6 3 5 4 2 It It 3 2 7 5 5 4 3 3 5 14 It 8 4 It I 4 3 4 3 5 9314 It 3 2 14 2 3 10 3 It 3 2 2 2 It I THRESHOLDED VECTOR MATRIX 2 3 "A" UNIT NUMBER It 5 8 9 10 STIMULUS IDENTIFICATION Pi P2 P3 P4 El E2 E3 E14 0 I 0 0 0 1 0 0 0 0 0 0 I 0 0 0 0 0 1 0 0 0 I 0 0 0 0 0 0 1 0 0 11001100 01101100 11100111 1 1 0 1 0 I 0 1 01100100 0 1 0 0 0 0 I 0 0= 31- PAGE THREE Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 STIMULUS AND WEIGHTS AFFECTED : "A" UNIT NUMBER THRESHOLD: TOTAL CHANGE : ACTIVITY VECTOR MASK PI(N , 3) P2(N = 7) 1111 2 3 4 5 6 7 8 9 10 'I CHANGE PER UNIT: E CORRECTED SUM : Ec STIMULUS AND WEIGHTS AFFECTED : 1111 El(N =3) 2lU 3 6 ii , 5 "A" UNIT 7 NUMBER 8 9 10 E3(N = 3 THRESHOLD : SUM: TOTAL CHANGE : 6 CHANGE PER UNIT: 6 CORRECTED SUM : PAGE FOUR Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 200T5i8t1d216CMZ5078B04770A002300030029-4 P-1.10 E.? STIMULUS IDENTIFICATION P2 E3 E4 P1P2 E2 P3 E3 E4 PI E1 o o I I I 1 I -I -I -I -I -I 2 0 0 -7 -7 -7 -7 -7 -7 -7 -7 -7 -7 -7 3 0 o o o -6 -8 -8 -8 -8 -2 -3 -3 -3 4 o o o o o o o o -2 -2 -2 -2 -2 5 0 o -7 -6 -6 -6 i I -1 -1 -1 -1 7 6 0 o -7 -6 -6 -6 -6 -6 -8 -2 -2 -2 -2. "A" UNIT NUMBER 7 o 0 o I -7 -1 1 -4 -3 -5 1 o -12 -4 8 0 0 o I i -3 4 4 2 2 2 -10 -2 9 0 o o I I I I i -i 5 5 5 5 10 0 o o I -7 -7 -7 -7 -7 -7 -8 -8 -8 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 -4 12 4 -10 0 8 -12 2 12 -13 -20 -7 8 1 -24 -8 -8 -4 20 7 1 ( 1 ) -16 -2 24 6 -4 -1 -24 -12 26 8 J 10 -11 3 -12 -4 II 1 -6 12 -1 -12 11 Ei P2 E2 P3 PI P2 E2 P3 P2 E3 E4 P2 1 -e -I o -2 -2 -2 -2 1 -2 -2 -1 -1 -1 -1 2 -7 -12 -12 -12 -12 -12 -12 -12 -12 -12 -12 -12 -12 -12 3 -3 -3 -3 -3 -2 -2 -2 -2 -2 -2 -2 -3 -3 -3 4 -2 -2 -2 -4 -4 -4 -4 -4 -7 -7 -7 -7 -7 -7 5 7 2 3 1 1 1 2 5 2 2 3 3 3 4 6 -2 -7 -6 -8 -7 -7 -7 -4 -7 -6 -5 -5 -5 -4 7 -4 -4 -3 -5 -4 -7 -6 -3 -6 -6 -5 -5 -6 -6 -2 -2 -1 -3 -3 -6 -5 -2 -5 -5 -4 -4 -5 -5 5 5 8 4 5 6 5 8 5 5 6 6 6 6 10 -8 -8 -7 -7 -7 -7 -7 -4 -4 -4 -3 -3 -3 -3 10 10 10 10 10 10 10 10 10 10 10 10 10 10 8 -5 7 -2 3 -2 -10 1 1 0 -6 o I -I 4 -16 10 -14 4 -6 It 20 -22 1 12 -1 -2 2 -5 1 -2 1 -3 1 3 -3 ( I ) 1 (-1) -I (1) Sc -7 2 -7 2 -3 I II -10 I I -I -I 1 PAGE FIVE Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 TRAINED PERCEPTRON RESPONSES P E STIMULUS IDENTIFICATION P3 EI I -1 _1 _1 _1 -I -1 -i 2 -12 -12 -12 -12 -12 -12 -12 -12 3 -3 -3 -3 -3 -3 -3 -3 -3 4 -7 -7 -7 -7 -7 -7 -7 -7 5 4 4 4 4 4 4 4 4 6 -4 '4 -11. -34 -4 -4 -4 4 "A" UNIT NUMBER 7 -6 -6 -6 -6 -6 -6 -6 -6 8 -5 -5 -5 -5 -6 -5 -5 -5 9 6 6 6 6 6 6 6 6 10 -3 -3 -3 -3 -3 -3 -3 -a r 10 10 10 10 10 10 10 10 E 3 1 3 6 -2 -3 -2 -I PAGE SIX Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 (r,i/La Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 INPUT/OUTPUT EQUIPMENT FOR RESEARCH APPLICATIONS Reprinted from "Proceedings of the NEC" 1962 Vol. 18 pp. 509-517 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 25X1 Approved For Release 2005/05/02 : CIA-RDP781304770A002300030029-4 CPYRGH INPUT/OUTPUT EQUIPMENT FOR RESEARCH APPLICATIONS By: W. S. Holmes and H. M. Maynard Cornell Aeronautical Laboratory, Inc. ABSTRACT The purpose of this paper is to review the input/output requirements for computers used in research activities, to establish a cohesive phi- losophy encompassing foreseeable requirements, and to illustrate that philosophy with case his- tories of equipment designed and constructed at Cornell Aeronautical Laboratory for its own research purposes. Research problems arising in the areas of photointerpretation, pattern rec- ognition, speech analysis and radar signal analy- sis are described with examples of specific solu- tions of these problems in terms of input/output equipment. This paper, in addition, demonstrates that access to a special input/output system will permit for the solving of research problems, a radically different approach which is not ordinar- ily evident at the outset of an investigation. INTRODUCTION Over the past decade, computers have pene- trated and served almost every facet of scientific research including research on information proc- essing systems themselves. Accompanying this penetration has been an intensified requirement for input-output systems satisfying special needs not met by standard available equipment. In the very recent past, movements towards satisfying these requirements are in evidence, but the over- all pattern has not been completely established. The purpose of this paper is to review the input-output requirements for computers used in research activities, to establish a cohesive phi- losophy encompassing foreseeable requirements, and to illustrate that philosophy with case his- tories of equipment designed and constructed at Cornell Aeronautical Laboratory for its own re- search purposes. In the course of the discussion, we will describe research problems areas impos- ing special requirements and to present specific approaches to the solution of these problems in terms of input-output equipment. In many cases, we have found that access to a special input- output system permitted a radically different approach to a research problem and opened re- search vistas not clearly appreciated before the revised approach was taken. In retrospect, it is possible to observe major trends in the use of computers which have im- posed progressively more exacting requirements on input-output facilities for general-purpose digital computers. These trends are roughly delineated by Table I. At the outset, computers were used chiefly in scientific computation. Characteristically, this imposed minimal demands on input-output equipment in terms of the quantity of data to be inserted into the machine. Card and punched tape inputs were entirely adequate and printer/plotter output largely satisfied display requirements. Use of computers in information systems such as SAGE, MISSILE MASTER, and NTDS imposed new requirements and raised two issues which significantly affected attitudes toward 509 Table I EVOLUTION OF COMPUTER USE AND INPUT/OUTPUT REQUIREMENTS APPLICATION I/O CHARACTERISTICS SCIENTIFIC COMPUTATION INFORMATION PROCESSING SYSTEMS LIMITED I/O DATA STEREOTYPED DATA OFTEN NIGH VOLUME DATA OFTEN REAL-TIME I/O INFORMATION PROCESSING RESEARCH (I) BY SIMULATION (ARTIFICIAL INPUTS) (2) BY IMPLEMENTATION (REAL INPUTS) (3) BY REAL-TIME EXPERIMENTS LIMITED I/O DATA INCREASING AMOUNTS OF I/O DATA UNUSUAL INPUTS SUCH AS PHOTOS, SPEECH, ETC. ENCOUNTERED EXTENSIVE DATA TRANSFER I/O COORDINATION WITH COMPUTATION AS WELL AS EXTERNAL SYSTEMS PRINCIPLE PROBLEM MASS DATA REDUCTION EXTENSIVE INPUT DATA BUFFER STORAGE REQUIREMENT input-output systems: (1) the use of computers in a real-time environment, (2) the need for mas- sive, sophisticated information processing re- search to make the system work at all. Issues of noisy inputs and nonlinear processes forced much of this research into the general purpose com- puter. The input-output needs for computers used in information systems, however, tended to be stereotyped, special-purpose, and therefore not of interest in a discussion of research re- quirements. At first, research on information systems problems was confined to analytic studies and simulation of the problem using artificially gen- erated inputs. More recently, pattern recogni- tion research as well as some signal processing research have entailed problems for which arti- ficial inputs cannot be readily generated. Thus, special-purpose input systems for pictorial and other forms of information have become needed. Consequently, we see information processing re- search imposing demands on computer input systems which cannot be met with conventional card or perforated tape inputs. At the same time that research on informa- tion systems became important, scientists deal- ing with experimental research problems began to seek in computers, a way out of the cumber- some, time-consuming, and costly data reduction problems which their experimental procedures were imposing upon them. Requirements for the processing of large masses of data then prolifer- ated, and solutions to input-output problems im- posed by this movement have become imperative. Generally, the data to be transferred into the computer are time-varying functions in several parameters and, by and large, some sort of a buffer store is required between the experiment and the computer. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH Since some of the most interesting input-out- put problems imposed by today's research activi- ties are entailed in the last two of the areas cited, information processing research and mass data reduction, we will concentrate on these areas as a means to focus this paper. INFORMATION PROCESSING RESEARCH The two major areas of information process- ing research which currently impose the most significant input-output requirements are shown in Table II. It is clear from the research topics Table INFORMATION PROCESSING RESEARCH PATTERN RECOGNITION SHIM PROCESSINO ALINA-NumERIC CHANACTER RECOGNITION NANO PRINTED ClikRACTEII RECOGNITION IIANOBRITTEN CHARACTER RECOGNITION TWIERAIIINT IDENTIFICATION m tAv INTERARETATIoN SPEECH RECOGNITION tECOONIIION Of BUIIMAIIINE BOUNDS DECOY OESCRImINATION AUTO-DOES SOUNDS RESEARCH COMMUNICATIONS COOING SPEECH OANOMIOTHCONPRESSION MOTO BANDWIDTH COMPRESSION ELECTROCARDIOGRAM INTEMPRETAIION ELECTROENCEPRALOORAN INTENPMETATION DATA AnCESSIND INTERTEROGIIAMfiNALTSIS PARTICLE SIEIND AND COUNTING CELL COuNTIBO OLTENIA COMITINO AEROSOL iiimEsTIORTION GEOLOOICAL muCRO-AS,ANINO delineated in.these figures that the salient feature of each problem which imposes an unusual input- output requirement stems from the fact that the Input to the system under study cannot be readily characterized analytically. Research in other areas such as radar track-while-scan research could readily proceed through the mechanism of artifically generated inputs. Confidence in the validity of these artificial inputs, while not per- fect, is high enough to justify use of the research results for most design purposes. On the other hand, characterizing either an aerial photograph or the essentials of the spoken word analytically is virtually impossible except for early explora- tion studies. Thus, research in these areas uti- lizing general-purpose computers is dependent upon input-output equipment suitable for the task. Further contemplation of the research prob- lems demanding special input equipment convinces one that a pictorial input system capable of ac- cepting monotone, opaque or transparency, material and transferring to store, four to five bit quantizations of the gray scale level for a spatial square matrix of l0 to 107 elements would satisfy a very large percentage of the problems involving two-dimensional patterns. Figure 1 shows the major blocks of a Photo Input System designed and put in operation at the Laboratory for use on its research programs.* The facsimile transmitter is a standard commer- cial unit designed to accept flat copy 8-3/8" wide. The scanning rate is 6 lines per second, corre- sponding to a feed rate of 3.6 inches per minute. The system is synchronized by incorporating a pulse generator into the facsimile transmitter. This consists of an inductive pickup associated with a 180-tooth gear driven by the mechanical scanning system at five times the line scanning speed. The resultant "clock" frequency is thus *This development was sponsored jointly by Geog- raphy Branch, Office of Naval Research and Photo- graphic Management Division, Bureau of Naval Weapons. 510 5.4 Kc with 900 pulses per line locked to the scan motion. The clock frequency after shaping is used as a trigger to control the conversion time of the analog-to-digital converter. The converter receives a continuous analog signal from the fac- simile scanner by way of the photomultiplier, cathode follower and gamma correction amplifier. (The latter restores the compression introduced in photographic reproduction processes.) At each trigger pulse, the instantaneous analog signal is converted to a four-bit binary number in 22 micro- seconds and appears on the four output lines of the converter. The converter also furnishes an end-of-conversion pulse 0.5 microsecond after conversion. This pulse is used after shaping, to control the "read" time of the computer via the "MG" line. The circuitry shown in the block diagram as the input buffer section accepts the digital infor- mation from the analog-to-digital converter on the four lines and modifies the levels correspond- ing to "0" or "1" to make them compatible with the computer. The gate associated with each level changer is under control of the "read select" line from the computer. The gate is either open, permitting information transfer, or closed, clamping the cathode follower to "0" level. The "read select" line also controls the copy feed motion via the cathode follower and relay in the control section. The photo input system is controlled com- pletely by the digital computer as if it were a standard item of peripheral equipment, such as a tape reader. A computer program developed by the Laboratory makes use of copy content to con- trol registration of copy area. A black bar, four Inches long by one-fourthinch wide, is placed at the top of the copy 0.4 inch above the desired first line of the area to be .read. The bar may be printed as part of the photograph or may be ap- plied as a strip of black tape. To perform a reading operation then, the facsimile transmitter Is energized, starting the scanning system. The prepared copy is inserted until engaged by the pinch roller using the roller hand wheel. The computer program is read into the computer in the normal manner. When the program step selects the real-time input, the read select line is ener- gized, opening the gates in the input buffer and starting the copy feed motor. The computer pro- gram starts a search routine looking at the digital Information presented to the I/O bus. This pro- gram step continues until the black bar is inter- cepted. The next step allows the copy to advance 0.4 inch, then initiates a second search routine to locate the pedestal (black level interval at end of each scan line). Location of the pedestal starts a counting routine that counts the number of MQ pulses received as the scan line advances from left to right. Upon reaching the desired magnitude (100 counts per inch), the computer transfers the digital number presented to the I/O bus into core storage. Each sample thereafter is transferred until a total MQ count of 900 is reached. Counting continues until the desired magnitude is again reached on the next scan line (without digital num- ber transfer). Establishment of the count again starts the digital number transfer to core storage and the cycle is repeated until a predetermined total number of MQ pulses has been reached, terminating the read-in program. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH PULSE GENERATOR MECHANICAL SCANNER OPTICS PHOTO- MULTIPLIER CATHODE FOLLOWER COPY FEED MOTOR CONTROL SCHMITT TRIGGER FACSIMILE TO L. RELAY CON TOOL CAT ODE FOLLOWER BLOCKING OSCILLATOR BLOCKING OSCILLATOR CATHODE FOLLOWER GAMMA CONVERSION TRIGGER PULSE --a. AN TO DIGITAL CONVERTER END CONVERSION PULSE CORRECTION Figure I ANALOG SIGNAL READ SELECT LINE INPUT BUF ER 20 LEVEL CHANGER GATE LEVEL CHANGER 1 CA 111 ODE 1_4_ FOLLOWER 21 GATE LEVEL CHANGER CATHODE FOLLOWER 22 GATE 1.4-? LEVEL CATHODE FOLLOWER 23 CHANGER H CATHODE FOLLOWER 1?i? OATE 1"4-40 _I PULSE h IMPL I F I ER IBI4 704 COMPUTER REAL-TIME INPUT I/O BUS "M Q" LINE BLOCK DIAGRAM OF PHOTO INPUT SYSTEM Thus it may be seen that the recorded area of copy is controlled vertically by the location of the black bar and the total magnitude of the MQ count. Horizontal location is controlled by the MQ count along the scan line. In the actual program, a packing routine is used to conserve core storage by consolidating nine four-bit samples into one standard thirty-six bit computer word. Flexibility of this control method should be apparent. Figure 2 shows the physical arrangement of the major components. The gamma correction amplifier appears at the top of the rack. Directly below is located the modified facsimile trans- mitter. Below the transmitter is the control and input buffer circuitry, while the analog-to-digital converter and associated power supply are located at the bottom of the rack. Primary power for the control and input buffer chassis is supplied from the 704 computer. Figure 3 shows the details of the pulse generator. Access to a photo input system enables a re- search worker to investigate two-dimensional filtering, processing, and decision making methods without costly breadboarding of proposed systems. Photographic inputs are inserted with full usable gray scale for numerical processing. Figure 4 shows an aerial photograph of which the left half was read into the computer, and printed out in four levels of gray scale using different letter symbols for each gray level. Figure 5 shows a numerical printout of alpha-numeric pictorial material which was read into the computer by the 511 photo input system and printed out using each of the ten numerals and four letters of the alphabet to represent each of the available 16 levels of gray. The nature of processing research which is accommodated by such a photo input system illus- trated by Figure 6 which shows an original photo- graph (6a) and three different types of numerical operations performed on the photo after read-in to the computer. From the full gray scale record, two-dimensional filters were used to produce the binary image (6b). Although it is not immediately apparent from the illustration, one of the two filters was a sophisticated variable, dual thres- hold filter. The other was a simple low pass filter. The objects in the (6b) binary image were then isolated one from the other to make the separate binary images (6c). A scissors pro- gram was used to produce this result. Finally each image was normalized by measuring c.g.'s, angle between horizontal and major axis of inertia, rotating each image through that angle, and re-sizing the images to a common vertical dimension. These examples afford only a glimpse of the research potentialities of such an input system. Naturally a matching photo output system is a useful adjunct to the photo input system described. Speech recognition and bandwidth compression constitute the remaining major area of research for which special purpose inputs may be of value. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH Figure 2 EXTERNAL VIEW - PHOTO INPUT SYSTEM Figure 4 ILLUSTRATION OF DIGITALLY STORED PHOTOGRAPHIC INFORMATION 512 Figure 3 DETAIL OF PULSE GENERATOR ggngUgggilgOggTgOggaggOgggOnggrg2ggggnOgggg:gegggilgOggnOgOgg:MgOgg2=PMTov 000m000m00000man00000noonnonoom000poom000noomoononoacoormooem00000m000no. m 301 0.0000000000000000000000000.00000000000000000NOPOPM10.0000000.000000000013000.0000004v.vuou001 .000000.000.000.000000000000000000.00000000000.00006040.000000000000000000.00000000000000000t gONINIINNNNNUNNUMMMIIIINPINSNNBHNBNUNMINNNMINNgUNOU o1379999u999e ',leo .000000000000001,.....911762.30.000000000000.4.999011807,152M000000001100( M4VAMUNngMagggr22221Vg2n4nEWUMNSMUNSINNINWHUNUUNgnNN One 1162111 11100D000000000000000.001978510.00000000000000000.0.0001.740011000000000000.100000000( 01.102100000000000000000.00000001S,851000000000000000.000(100.0000?741000000000000000000000000( 013711511000000P00000000000000000000197711000.0000M01)00000000000000/75.00000000000001100000000( 0137.1100010000000000000000000000002675.0004000000000000n00000000001277,1000000000000000000000000( aRMINggN22,3gg=2=4NNgging=====nNgNaMINSI8ggggf 0121,7032210000000000000.0000000012579210000000000000000.000000000026.2000000000000000000000000( 012.0100000000000000000000000n0000147510000000000000.006000.000000161.200000000.00000000000000( 0025,62000000000000000000000000000002570200000000000000000000000000000137.1000000000.000000000000. 002.61000.000000.0000.00000.0025.621000000000000006000000000000010.2000000000.000000000000K 00147010000.00000000000000000000.01.76200000000000000000000000000000157620.0000000000000000000000C Mai.142M12g=gg=2:41::!!??2=gggg2g=gg=g3g11.1:;:2:MMIN:g2:24:2:? 001255922221111000000000.00000000001367.33333210000000000000000000001311977,171.0200000000000000t 20 11:::::2 :::02 I00 0 0 0 0 00 0 1 00 0 0 0 018=2::1 :24 2 2 20 0 80 0 00 00 0 00 00 0 0000 11011122 tg2 2 0 a02 2 gg0 = 001314.33M2100000000000000000000000011110000000.00.000000000000000000000000000000000000000000. NggIgg41:42TATMT4NOMgggggggggSaggg8g2gggVnggggggggNg&ININITAIgggggS=1 ggNg4MINgg2g2N23g=ggnN=IgNgSgM:gggg;g:4MgggggNgNogg2gN222220. Figure 5 NUMERICAL PRINT-OUT OF PHOTOGRAPHIC DATA pp(ovecl I 01 tiii. - Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH T Figure 6a ORIGINAL PHOTOGRAPH ISOLATED SI LHOUETTES FROM PROCESSES PHOTOGRAPH 1689 1688 Figure 6b PROCESSED PHOTOGRAPH AFTER OBJECT DETECTION AND LOW-PASS FILTERING STANDARI ZED FON OF I ROUTED SI LHOUETTES 1690 Figure 6c ISOLATED SILHOUETTES FROM PROCESSED Flgure 6d STANDARDIZED FORM OF ISOLATED SILHOUETTES PHOTOGRAPH 513 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH Actually the approach one uses for this type of re- search is as much a matter of preference and background as necessity of or even advantage. Laboratories staffed with experimentally oriented personnel, well versed in filter and active circuit design, may not observe any advantage in the flexibility afforded by gener-t; purpose computers. Nevertheless, we cannot ignore the fact that very flexible filter programs for general purpose machines can be prepared and almost any experi- mental operation which can be performed on a speech record could be as readily performed within a general purpose computer. An input system for speech research purposes could be limited in performance to cover only the frequency bands known to be needed for recogni- tion of spoken words by humans, but to so limit the system might undesirably inhibit research. Thus the system described here, in use at the Cornell Aeronautical Laboratory, covers a sampling rate up to 27000/ second and employs much of the same circuitry developed for the photo input system. In regard to Figure 7 one can see that the input buffer section is similar except that the full eleven bit capacity of the A/D con- verter is used, thus affording 1/2% least count for the system. The control unit accomplishes the same functions as in the photo input system with added circuitry to provide one additional control line to the computer in the form of an end of record signal. Both of these features enhance the flexi- bility of the system by allowing complete control CLOCK PULSE END OF NEC. ANALOG TAPE TRANSPORT OR DIRECT DATA CONTROL SCHMITT TRI GGER BLOCK IMO ASCII LATOR CATHODE FOLLOWER BLOCK ING OSC ILL A TOR BLOCK IRO OSCILLATOR CATHODE FOLLOWER -J CONVERSION TRIGGER PULSE by computer program. For example, the com- puter program can specify the number of bits de- sired in a particular application to match the accu- racy of the analog data presented. The end of record pulse serves the function of segmenting the data in convenient blocks for subsequent mani- pulation within the computer. Performance of this system is intimately related to the processing program desired for the data reduction. The converter has a conversion rate of 44,000 samples/second, but when applied to an IBM 704 computer using fixed program logic and no word packing, a resulting read-in rate of 27,777 samples/second is obtained. Adding dynamic program logic reduces the input rate to 16,667 samples/second. As a consequence of finite core storage and inadequate time for trans- fer to tape during a run, lengths of coherent records are limited unless word packing is em- ployed. Packing the 11-bit information in the 36-bit word length in order to extend run length further reduces the input data rate to approxi- mately 6,000 samples/second. One of the first applications of this equipment was in connection with the investigation of radar video. Here the objective was to improve experi- mental measurements by long term integration. The video signal was recorded on a suitable mag- netic recorder together with a trigger pulse and range marker pulses on separate tracks. The ANALOG TO DIGITAL CON VERTER ANALOG SIGNAL END CONVERSION PULSE END OF RECORD LINE INPUT BUFFER LEVEL CHANGER 20 GATE H CATHODE FOLLOWER r TE V EL- -1 L CHAN GERd---"i ECA HOD T2^ 1201.10WERJ GATE L 21 I LEVEL CATHODE CHANGER FOLLOWER GATE IBM 7014 COMPUTER REAL-TIME INPU I ) I/O BUS ---b. HA PULSE PLIFIER "M Q" LINE Figure 7 BLOCK DIAGRAM OF GENERAL PURPOSE ANALOG INPUT 514 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH signal then was played back at reduced speed and fed to the input of the general purpose input sys- tem. The trigger pulse served as the end of file input while the range marker pulses served as clock pulses. From the specifications on the system, it is clearly compatible with speech recognition and bandwidth compression requirements. Thus, we see the outlines of approaches to the solutions to broad areas of information proc- essing research. This equipment has been found not only to facilitate the special research for which it was designed, automatic photointerpreta- don, and radar information studies, but also to have a broader application to research problems of the two generic classes listed at the outset. MASS DATA REDUCTION Wind tunnels have provided one of the earliest forces toward mass data reduction by digital com- puters. Digital recording systems have been em- ployed in the Cornell Aeronautical Laboratory's variable density tunnel since its inception in the 1942-45 era. A large number of parameters are measured very accurately; equations governing data reduction are complex; and timely results for examination in the course of a sequence of runs is of great economic advantage. Although on the surface these factors might suggest an on-line operation with direct electrical connection of the sensors into the computer, the Cornell Aeronauti- cal Laboratory found the use of punched card buffer store to be satisfactory and more economi- cal. The notion of having experiments funneling data into the real-time bus of a computer for processing and display of instant results at the experimental site has a certain appeal to the show- man in us, but can rarely be justified. Most input-output systems for mass data reduction will incorporate some form of buffer storage. Choice of buffer stores is part of the input- output design problem. Experience with strip chart recorders - the principal buffer store during the past two decades - predicts that computer input systems will be required to provide compati- ble buffer stores over a frequency spectrum of at least 0 to 5000 cycles per channel, with channel capacities ranging from two to ten. Experience at the Cornell Aeronautical Labo- ratory with research problems ranging from air- craft control problems to measurement of three- dimensional radar cross-section profiles, has demonstrated a clear, high-volume research re- quirement for a digital buffer store to handle an information rate of up to ZOO bits per second. The basic system, using punched perforated tape, is sufficiently flexible to permit selection of the number of channels and dynamic range to be em- ployed. Clearly, given a basic information rate, trade-off among (1) numbers of channels, (Z) maximum frequency to be recorded, and (3) dyna- mic range required, are governed by the relation hi ? P- T i P j loge L? where = information rate in bits per second n = number of channels maximum frequency of jth channel in cycles per second maximum level of jth channel required least count of jth channel L1 = This relation presumes, of course, a substan- tially noise-free system. Naturally one is faced with a compromise in deciding how much flexi- bility to provide in order to take maximum advan- tage of this relationship over a broad range of possible experimental requirements. We would not want to suggest that the system to be de- scribed (ANDIT) is an optimum compromise; in fact, second generation models have tended toward less flexibility. ANDIT equipment (Figure 8) affords a buffer store with an information rate capability of approximately 200 bits per second. This com- pares with direct galvanometer recorders (14 bits/second per channel) and electronically- driven galvanometer recorders (700 bits/second per channel). Experience with strip chart recorders would suggest that two additional buffer stores capabilities, one with information rates in the region of 1,000 bits/second per channel, and the other with rates around 50,000 bits/second per channel would have utility for experimental research investigations. Naturally if one can accommodate both rates in a single economical system, that approach would be superior. Figure 8 DIGITAL RECORDING EQUIPMENT 515 Arrrtwari Frtr Palaaca 9ringing/(19 ? rha_pno7ARnA77nAnn9lnrinlrin9cLA Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH The ANDIT equipment characterizes an ap- proach to buffer storage in which analog to digital conversion occurs prior to making the permanent record. This feature exhibits the advantage that channel multiplexing and therefore flexible em- ployment of the available information rate is facilitated, but has the disadvantage for higher information rates that it places a more expensive piece of equipment at each experiment, thus raising the cost of mass data reduction to each experiment. Another approach, that of using analog buffer stores, is characterized for precision records by magnetic tape instrumentation recorders. This approach has been explored commercially and instruments are available covering information rates up to approximately 200,000 bits/second per channel. These systems are more expensive than the strip charts which they would replace, but the added capital investment is more than off- set by the elimination of costly manual conversion. When analog buffer stores are used, it is neces- sary to accompany those stores with a suitable analog-to-digital conversiqn facility at the general purpose computer. An example of such a system roughly compatible with requirements for analog buffer stores has already been discussed in con- nection with speech research interests. Clearly flexible record-playback speeds are desirable to match the conversion system effectively. The issue of allowable record length impinges on both the buffer store and on design of analog- to-digital converter systems feeding directly into a main frame. Characteristically, as the infor- mation rate increases for a buffer store, the allowable coherent record length decreases. In the course of conversion, analog-to-digital, for injection into the computer, a similar limitation is encountered. Whenever the information rate is sufficiently great that transfer to tape is not possible in the course of data breaks, the core storage of the computer eventually is exceeded and conversion must be discontinued until transfer to tape can be effected. Naturally, this operation produces data discontinuities unless techniques are employed to ensure coherence from one record to the next. So far, the Laboratory has not in its research encountered this problem to the extent that special techniques needed to be created. Analog-to-digital converter systems for use in mass data reduction do not constitute an un- usually sophisticated design problem. Converters with speeds acceptable for this service are avail- able and can usually be adapted for coupling to the real-time input bus of a general purpose machine. The system described under speech recognition input systems is characteristic of such a device. High speed printers and x-y plotters are obviously still effective output systems for re- search purposes. In this discussion, we are interested only in new approaches to display of data which may afford solutions to some of the more recent research problems being undertaken, and will not touch on advances in printer speed or plotter facility per Be. For example, this Laboratory has constructed for its research purposes an output system which 516 permits plotting the time-varying results of computer reduced data on a high-speed photo- graphic-trace oscillograph, and this kind of a facility is characteristic of the special-purpose output systems we are interested in discussing in this paper. At present, we have partially completed the design of a photographic output system for use in conjunction with the photographic input system already described. Such a capability is important if one is attempting to design two-dimensional spatial filters. Thus, the research worker is able to assess easily the results of a filtering operation on a given piece of input material. We expect, however, that this output system may have broader application than to studies in automatic photointerpretation and pattern recog- nition. For example, it affords the interesting Opportunity to read-out computed curve plots in conjunction with a graphical format which was read into the machine through the photographic input. Thus, for example, a research worker could insert a log-log or polar or other graphical record format into the machine, complete with titles, ordinates, and perhaps even the symbols to be used by plotting each of several curves, and having stored this format on tape, and generated the curve points with a computational program, read-out the composite result of both these processes through the photo output system. He is thus able to produce a complete graphical picture of the desired results. Such a picture would be entirely suitable for reproduction in reports or slides, and thus would facilitate a more economic reporting process. The potential for presenting numerical information in three dimensions, X, Y and density, should find increased application to report and documentation efforts. Imaginative employment of this technique of compositing graphical data derived from philosophically dif- ferent sources may very well lead to some unusual research techniques, unforeseen at present. CONCLUSIONS Evidence points to a well-established require- ment for pictorial input-output systems for mod- ern research in pattern recognition, signal proc- essing, and data reduction. Means for entering computers with time-varying data up to 20 kc for speech recognition and related research is an additional requirement. Such a system may at the same time serve the purposes for mass reduc- tion of data gathered by analog buffer stores in the 1,000 bit/second and above classes. Perfo- rated-paper-tape buffer-store input systems afford an economical input medium responsive to experimental needs which formerly were handled by ink-pen trace strip charts. This same perforated tape format can, if designed for suffi- cient flexibility, invade an area of recording formerly handled by electronically driven galva- nometer recorders. The principal merit of the perforated tape system is low cost equipment at the experiment site, ease of editing data before entry to the computer, and a reasonable degree of facility for injection of the data into the general purpose computer. Moves toward higher and higher perforated paper tape recording speeds may entirely satisfy the needs currently being Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CID'YRG-kT Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 handled by electronically driven galvanometer strip charts. However, systems (possibly analog instrumentation recorders) making available information rates in excess of 60, 000 bits/second per channel will be required for specialized experiments unless analog-to-digital conversion coupled with digital buffer stores of sufficient information rates can compete economically. If the latter becomes feasible, compatible format magnetic tape records would be a desirable feature. Photo Input/Output systems offer a potential for information compositing (curve-on-chart) as yet unexplored. ACKNOWLEDGEMENTS The authors wish to acknowledge the contri- butions made to this paper by the work of col- leagues which provided input-output examples. Cognitive Systems Section personnel, notably Dr. H. R. Leland, Head, Mr. G. E. Richmond and Mr. C. W. Swonger, recognized the need for the photo input-output system herein described and furnished empetus for its design and construction. The high speed analog-to-digital input system was constructed to meet specifications established by Mr. R.F Schneeberger. The ANDIT equipment was conceived, designed and first put in opera- tion by Mr. H. F. Meese for Terrain Avoidance research. Mr. C. L. Syverson and Mr. T. J. McDade have extended the use of perforated tape recorders in the design of special purpose systems for our Radar Cross-Section Ranges. The infor- mation compositing notion was suggested and implemented by Mr. M. B. Cohen who is respon- sible for operation of the Laboratory's IBM 704 computer services. We also wish to acknowledge the encouragement and interest of Mr. W. M. Kaushagen, Head of the Laboratory's Electronics Division, and of Dr. M. G. Spooner, Assistant Head of its Computer Research Department, in encouraging the development of input-output facilities for research purposes. 517 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 PP r Roloaco 2005/05/02 ? CIA RDP78B01770A002300030029 1 TWO-DIMENSIONAL SPATIAL FILTERING AND COMPUTERS W. D. Fryer and G. E. Richmond Cornell Aeronautical Laboratory, Inc. Buffalo, New York ABSTRACT CPYIGH Processing of two-dimensional signals has important applications, for example, in photographic image analysis, but when the weighting function of a two-dimensional linear filter extends over a large area, e. g., smoothing filters, digital realization via a two-dimensional convolution is prohibitively time consuming, and analog reali- zation is extremely difficult. The principal purpose of this paper is to show how a broad and useful class of two-dimensional filtering operations can have notably shortened execution time in the digital case, and be put into a particularly convenient form for electrical filtering. The procedure includes a reduction of dimension from two to one; transformation of two-sided (weighting function extends into both past and future) operations into one-sided (physically realizable) operations; and finally, for the digital case, the transformation of a direct many-term convolution expression into a compact recursive form ideally suited for digital computation. An important class of smoothing filters, with weighting functions approximately Gaussian, is derived and used for illustration. The result is a several-order-of-magnitude reduction in time for digital two-dimensional filters, and some interesting results applicable to one- dimensional zero-phase-shift filters. 1. INTRODUCTION Analog or digital processing of two-dimen- sional signals has many important applications, notably in photographic image analysis for mili- tary or commercial purposes. Much of this type of isiocessing is special purpose, tailored to the physically significant details within an image. But there are certain basic operations (e.g. high and low-pass filtering) that are extensions of ..or responding one-dimensional operations, and which have a similar range of usefulness. Because in two-dimensional processing by digital means, or by means of electrical filters, storage and processing time requirements are much greater than for one dimension, there is a genuine need for nontrivial methods to allevi- ate time and storage problems. This need is greatest when each computed point in the output image is affected by values from a relatively large area of the original or input image. Such 'An exception is the optical filter, not conside7.-ed in this paper. filters, called area filters, are exemplified by many smoothing operations; they form the principal topic of this paper. In contrast to the area filter is the local filter, in which the va of an output image point depends only upon a small neighborhood of the corresponding input image point. II. THE DIRECT CONVOLUTION APPROACH The direct approach to realization of an arbitrary filter operating on a two-dimensional input is explicit convolution of the impulse re- sponse of the filter with the input. In the digital case this takes the form: ym,n L - where subscripts on y (output) and (1) y (input) give the digitized coordinates of the image point, and h?? is the discrete two-dimensional impulse response or weighting function. In the case of a local filter the number of terms in this expression is small, and the method Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH is practical and often useful. For example, Kovasznay and Joseph (Ref. 1) describe an inter- esting analog equivalent to (1) used for outline enhancement. David (Ref. 2) reports a digital application of linear and nonlinear local opera- tions for noise reduction. In the case of an area filter, the large number of terms (121, for example, if / =J=5 required for each output point makes this approach impractical. III. THE SIMPLIFICATION STEPS Equation (1) of the direct method is the starting point for a number of drastic simplifica- tions. The goal is a set of one-dimensional filters, of simple recursive form for digital com- putation, and of physically realizable form for analog use. The simplification steps are: (a) Restriction of the weighting function or h ( , ft ) to a product form 9, or 1'0, ) 9 ( t, ) ; this special form allows the two-dimensional prob- lem to be decomposed into two one- dimensional problems. (b) Transformation of the one-dimension filtering operation, which has an impulse response extending into both positive and negative time values, with two filtering operations of the "physically realizable" form, in which output depends upon the "past" (impulse response vanishes for negative argument). (c) In the digital case, conversion of the one-dimensional filter into a recursive form so that only very few terms appear In the computation of an output value, even though the effective memory extends far into the past. These items are discussed in the following four sections. IV. REDUCTION OF DIMENSIONALITY The first major simplification of (1) is the reduction in dimensionality from two to one, by restricting (with some loss of generality) the weighting function of two variables to be of product form h41= '9, (discrete) or h (e? e2) - ) 9 ) (continuous) Then the complete double summation of (1) may be replaced by two sets of calculations, each only having a single sum; omitting details, and again writing only the digital equations, we have: r Yrn,n Here, is an intermediate result, com- puted from the 8, then regarded as input variable for final computation of the desired values. The first computation (2) is (for any fixed r ) an ordinary one-dimensional filter, operating on one horizontal!' line of the image; the parameter r identifies which line. Similarly, the second computation (3) is, for any fixed fr7 , an ordinary one-dimensional filter, operating on a vertical line2 of the y image; the parameter rn identifies which vertical line. (2) (3) To illustrate the general effect in terms of saving computing time, suppose that a 100 x 100 grid of picture intensity values is to be filtered, and assume that the weighting function of (1) extends 10 terms in each of the possible directions ( / = J 10 ). With edge effects ignored, the application of (1) directly would require 21 x 21 = 441 operations (an operation consisting of a multiplication and addition) for each proces- sed point, for a total number of 4, 410,000 oper- ations. The corresponding double application of one-dimensional filters, according to (2) and (3), would require 21 + 21 = 42 operations per proces- sed point, for a total number of 420,000 operations -- less than 1/10 as many. zWe arbitrarily identify horizontal lines with fixed second subscript, varying first subscript, and vertical lines with contrary conditions, in analogy with the continuous form tz) where the first variable normally gives the abscissa value and the second gives the ordinate value. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 CPYRGH Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 V. SIMPLIFICATION OF THE ONE-DIMENSIONAL FILTERS The following approach towards simplifying the one-dimensional filters of (2) and (3) gives results definitely useful in originally one-dimen- sional filtering problems, as well as in the pres- ent context as intermediate aids for two-dimen- sional filtering. It is convenient to drop the redundant double subscript, and begin with the generic form (open- loop, double-sided) form that is implied by (2), (3) and the preceding discussion: "in 2 hk Xn-A y I '1.4(ri x(/- r)ir (discrete) (continuous), (4) and it is convenient to regard a subscript as a time value (e.g., y? is the value of y at the quantized time value n ). Equation (4) could be used in its existing form for digital computation. But for the case where the number of terms is large, there is a more efficient method most conveniently derived by orienting our terminology towards the continu- ous-time situation, so that we can simultaneously develop a method suited to electrical filtering and of a form that can be converted to an efficient re- cursive digital filter. Write h (r)- h_(r) , where 17, vanishes for negative time, and h_ vanishes for positive time. The h?. part could be the im- pulse response of a realizable filter, assumed to have rational transfer function 6 (s) with its poles in the left half plane. The h_ part may be achieved with a realizable filter by reversing the time variable, for example, by storing the input on magnetic tape, then playing the tape backwards. A reversed-time signal passed through a realizable filter with transfer function gives a result that is formally equi- valent (in the sense of a bilateral Laplace trans- form) to passing the original forward-time signal through a filter with transfer function f;(-f ) . Thus we may speak of filters with poles in the right half plane, with the understanding that they refer to a physically realizable filter driven by a time-reversed version of an input signal. 3 Thus, if the original picture is processed with filter having transfer function f; (5) then the original picture is filtered independently with filter having transfer function F, (- 5 ) (actually accomplished by running the input signal backwards through a filter with transfer function F,() ), and the results are added, the result- ant impulse response will be desired 17, h_ and the total effective transfer function will be The rational functions f; (j) and 1", ) may be combined into a single multiplicative expression, of the form where G, contains all of the poles in the left half plane and G, contains all of the poles in the right half plane. Thus, we have a cascade form: rather than filter the original picture twice and add results, as in the previous paragraph, one could filter the original picture with 4, , then filter that result with G? . In the digital case, the latter has the advantage of eliminating the need for duplicate storage space, by means which, if not obvious, are simple. For analog filtering the' cascade form is more convenient and avoids various practical synchronization diffi- culties. VI. DIGITAL ONE-SIDED RECURSION FILTERS An open-loop one-sided digital filter has the form Yn hoXn # '41 Yn--, h2xn-2 '" ? ? ? the expression possibly being infinite in extent. "Open-loop" refers to the fact that y is expres- sed only in terms of 's (inputs); "one-sided" refers to the fact that only present (time n ) and past values of input are used to determine output. For example, the one-aided open-loop digital filter corresponding to an exponential impulse response (simple RC lag filter) is Y0 a2,0-2 ? ? . (5) Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 Approved For Release 2005/05/02: CIA-RDP78604770A002300030029-4 CPYRGH If, as in this example, the number of terms is infinite, then digital computation can be perform- ed only by approximating with a finite number of terms. But if ce in the expression above is fairly close to unity, (e.g., 0.99 ), it is possible that several hundreds of terms would be required for satisfactory approximation. A one-sided feedback digital filter equivalent to (5) has the form Xn (6) where cc is the same one as in (5). This form is of course particularly suited to digital compu- tation, since it contains only two terms. "Feed- back filter" here refers to the fact that output depends explicitly on prior output, as well as the y input; this type of filter is also commonly called a recursive (or recursion) filter. VII. RECURSION FORMULAS FROM TRANSFER FUNCTIONS The general problem of converting a transfer function (such as our (3) , if additive form is used, or c, (3) if cascade form is used) into a digital recursion relationship is easily solved by means of a method presented at the 1961 NEC (Ref. 3, 4), which is described here only in necessarily very sketchy form. In the expression for a transfer function , make the substitution 2 1/- , clear of extra- neous 142 fractional forms and normalize numerator and denominator into the following form: ao a,i ,4-4.,Z27, ? ? ? i? afrnZm I 1611 bz Zz ? ? ? .kbrz" (7) ( Z can be interpreted as the delay operator, with Laplace transform 6,- sr ). Then the recursion formula becomes Yn )111 - I. ? ? ? 17 - ra-r " "hr yn-r ? The quantity T is a time scaling parameter; in this application, it relates the time variable of (8) 4 the Laplace transform to the separation between adjacent sample values. To illustrate, we use a transfer function that will later be used to illustrate another aspect of the two-dimensional filtering problem; 2 # 35 The impulse response has time constants of the order of one second in real time. Suppose we desire that over one of these time constants there would be 10 picture elements (in a rough sense, the memory covers ten picture elements, if memory is taken to be a nominal time constant). Then T = 0.1. Use of the substitution described above gives for the final recursion formula: 4100849 (in ZYn-/ #)n-j) # 1.95/y/74? 0.97(96yn_1 VIII. ILLUSTRATIVE EXAMPLE In many two-dimensional filtering applica- tions, it is desired to have an impulse response that is circularly symmetrical. The purpose of this section is to show how one class of such impulse responses can be approximately realized with the multiple application of one-dimensional filters as described above. Circular symmetry requires that the filter impulse response h be a function of Thus the one-dimensional filter g(t) which is to be applied in each direction must be chosen such that h (t?tz)= ksezth 9(4.z ) F (t;) =y (t; ) . A solution of this functional equation is (9) (10) In accordance with the previously prescribed procedure, we now attempt to find a one-sided, one-dimensional filter whose impulse response is f(t) = e-ez > 0 0 t the equation is termed Nth order. One will then require at least N storage locations in a digital computer, or at least N delay lines in a recircu- lating delay-line digital system. It is important to note that for N> 0 the g(k) sequence will usually not ter- minate. The system smoothed-velocity output sequence ? can be given by 4 = E g(k)x,_k where Mk) =the input position to output velocity weighting sequence or unit-impulse re- sponse. (3a) (3b) A difference equation for the velocity output could also be written. One can also combine the difference equa- tions for position and velocity into a set of difference equations. For example, the common [2 "a-0" or "g-h" (g =a and h=0) equations become Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 r.PYRnH 28 ApprovphIEFFAMOg/202Aplib?,114F17BEIgnw300030029-4 = xo,? ce(OC,, ? Xpn) (4a) X.. = 74-1 --(x ? xpn) (4b) 7' = + (4c) where T =sampling period and predicted position at n+1 time index. The set (4) is second order, as can be seen by writing them in terms of only one output variable. Z transform 13] one can characterize a sampled- data tracker by a multiplicative operation Y(Z) = G(Z)X(Z) (5) and (Z) G(Z)X(Z) (6) where Z = ei'T is a forward time shift and the trans- forms for signals are given, for example, by ,v(z) = E (7) n. 0 kiid for the system, for example, by 6,(Z) E g(n)Z'. The inversion is given, for example, by gr(n) = 1 ci(z)zn triZ. 27ri r (8) (9) The indicated contour of integration, F, is the unit- radius circle in the Z plane. Ill. PERFORMANCE MEASURES n. order to assess properly the two attributes of noise reduction and transient performance, two measures are introduced; in every case their effects will be considered si inultaneously. For noise smoothing, the performance measures will be, for constant input variance, the variance reduction ratios steady-state variance in position output variance in raw position input si eadv-state variance in velocity output K?(0) ? - - variance in raw position input The notation K.(0) and Ki(0) is used to coincide with the definitions of .K.-,r(n) and K(n) the input-normalized autocorrelat ion sequences of the position and velocity outputs. K?(0) and K(0) are calculated (for uncor- related scan-to-scan input noise) from the corresponding unit-impulse responses g(n) and g(n) by the formulas' See Appendix I. K(0) = E g2(n) July (10a) Ki(0) = E g.i2(n). (1(11)) For transient (maneuver-following) performance, the demerit figures will be [referring to (1)1 and = E [(unit-increment ramp) ? (position ramp response)] 2 n=0 E [n ? gx(j)(n ? j) n=0 j=0 12 = E [(velocity of unit-increment ramp) (velocity ramp response)] 2 n E ? E Mj)(n ? . n=0 j=0 Ramp-type test inputs of component position are se- lected because they are realistic for radar TWS opera- tion on airplanes (sudden heading changes) and are also realistic for any sampled-data tracking start-up tran- sients. Meanwhile, consider the evaluation of arbitrary tracking systems on these performance bases. See Fig. I. Here arbitrary tracking systems are compared. One seeks to design a system which is "close" to the origin. However, for Tracker A, as parameter p is increased so as to decrease Di', .K?(0) increases. Tracker B is clearly better than Tracker A in Region I; Tracker A is clearly superior in Region II. The optimum tracker by definition has a single-parameter locus where the (12) TUCKER El r 3p, TRACKER A TPACKER C NOISE - SMOOTHING DEMERI F. K,(0) Fig. 1 ?Arbitrary system position performance. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 1962 Benedict and Bordner: Radar Track-While-Scan Smoothing Equations Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 ray from the origin is shortest everywhere. Such a tracker is illustrated by Tracker C on Fig. 1. The equations derived in the next section will be optimal for position and velocity tracking simultane- ously. Some concrete examples are now given. Example 1: For the a-13 tracker as listed in Section I, if one calcu- lates the impulse response and thence the performance measures, he obtains Table I. Some parameter pairs (a, (3) will be poor compromises. The pairs (a, a2/2 ?a) were shown analytically to form the optimal locus for the (a, 0) tracker in position and velocity. This analysis is performed by 1) assuming K(0) a constant, then dK = 0; 2) the value of a (or (3) that minimizes D cor- responding to the constant value of K is found by equating dD/da to zero; and 3) solving dK =0 and dD/da=0 for 13 in terms of a. These loci are plotted on Fig. 2a and 2b. TABLE I PERFORMANCE MEASURES FOR THE a-0 TRACKER Position Output Velocity Output Variance Reduction Ratio Transient Performance 2a2+13(2-3a) 1 202 a [4-0-2a] (2?a)(1?a)2 T2 a [4? 2a-0] 1 a2(2 ? a) +213(1 ? a) D2= ? (X0 [4 2a] T2 00 [4? 2a ? /3] Example 2: A first-order (one pole, two zeros) tracker is given by xn+i = C( an CiXn C2X7,--1 CSX,,-2. (13) For the conditions co + ci c2 c3 = 1 (unit dc gain) C2 = 3 ? 2c0 ? 2c3 (zero steady-state ramp-following error) one has the measures 13? 15c0+ 4632? 2c1(8 ? 7co+ co2) 2c12(3 ?co) K(0)= (14) 1+co (2 ? ci)2 .W=1+ 1 ? co2 (15) These measures are plotted parametrically on Fig. 3. This tracker shows up everywhere worse than the opti- mal (Li = a2/2 ?a) a?(3 tracker.2 2 This judgment will be lessened in magnitude, but not reversed, if the first-order tracker used present input and present output. Noise demerit T2K-(0) (a) Noise demerit K.(0) (b) Fig. 2?(a) Tracker velocity performance. (b) Tracker position performance. 1.35.1.111171/10 PERFORNAIIER OF FIRST- ORDER TRACKER Variance Reduction Ratio K.(0) Fig. 3?First-order tracker position performance. Approved For Release 2005/05/02 : CIA-RDP78604770A002300030029-4 2 CPYRGI fiTy fi) IRE TRANSACTIONS ON Approved For Release 2005/05/02 I V. SYNTHESES BY CALCULUS OF VARIATIONS Suppose now that one wishes to find the optimal set of tracking equations. That is, it is desired to select perfectly freely the impulse responses Mn) and g(n) so as to minimize D,,2 for a given IC(0) and vice versa, and so as to minimize /3?2 for a given tri-(0) and vice versa. Taking the velocity case for an example, one wishes to minimize = Ki(0) ADth2 (16) where X is the Lagrangian multiplier (and the final single parameter). Letting r(n) be the optimal unit-increment ramp re- sponse, one has the double-difference identit gi(n)= r(n + 1) ? 2r(n) r(n 1). (17) Then 3 = E [ I r(n + 1) ? 2r(n) r(n AUTOMATIC cosTnw : CIA-RDP78B04770A00200030029-4 Note that (22) holds only for n>0 since the fact that h(n) =0, n

Printer-friendly version

Search form

REPORT ON PERCEPTRON ((Sanitized)AND CONFLEX(Sanitized)) CONCERNING AUTOMATED IMAGE RECOGNITION