版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
1、Analyzing the Application of a Reverse Engineering Process to a RealSituationFabio Abbattista (*), Gregorio M.G. Fatone (*), Filippo Lanubile (*), Giuseppe Visaggio (*)(*) Dipartimento di Inform atica, University of Bari, Italy(*) Basica S.p.A., Potenza, ItalyAbstractA reverse engineering process mo
2、del was applied and on the basis of the data collected, some modifications were made aiming to improve its efficacy.The experience gave rise to various considerations of interest, first among them being the clear interaction between the quality of the product and the quality of the process. A method
3、 of synergetic application of static and dynamic analysis to improve understanding of the program was consolidated. The experience enabled modifications to be introduced connecting the reverse engineering process more closely with the understanding of the programs and information deriving from the a
4、pplication domain.Finally, the problem of the efficacy of the tools used to obtain the reverse engineering products was made evident during the experimentation on the field.1: IntroductionWe present an experience in which process qualityand product quality interact and mutually improve one another.
5、The process is reverse engineering while the product is the documentation of programs necessary to exploit the program better. The salient point to be gained from the experience are, in general, the model as it appears after improvements stemming from trial on the field and, in particular, the metho
6、d for integrating static and dynamic analyses to improve the process.The paper describes the application of a process model to a real situation. The scenario is a set of programs with the following characteristics:language:COBOLoperating system:BS2000total no. of programs:653no. of on-line programs:
7、348no. of batch programs:305no. of files:70no. of data:9 000no. of Instructions:900 0002: The reverse engineering processOur reverse engineering process had two main objectives: (1) to increase the ease of maintenance of the software system, and (2) to improve its usability by the final users and th
8、e ease of knowledge transfer among different users 5.The first involved reconstruction of the project design documentation and restoration of the most degraded arts while the second required reconstruction of the user documentation and the data conceptual model. This elps users to understand their o
9、wn information system etter from the point of view of the data processed 4,6.Figure 1 shows the process model, which is briefly described underneath. Further details may be obtained from 10.1. Inventory Software System. The following cross references are extracted from theold software system:call de
10、pendence X-ref, copybook X-ref, and file access X-ref.2. Reconstruct Logical Level of Data. From the data description, the hierarchical structure constituting the logical data model is reconstructed. The aliases are recognizable entities in the application domain. Figure 2 shows a record declaration
11、, as input to the phase, and the relative hierarchical diagram, as output from the phase.The arrows represent the hierarchical relationships between the substructures in the record; elementary data are not shown. The data are classified as application domain data, control data, and structural data.A
12、pplication domain data are the attributes of recognizable entities in the application domain. For example, the field named MT02-02 representing the amount is also an attribute of MORTGAGE and is therefore recognizable in the application domain.Control data have no correspondence with the application
13、 domain but are used to record the occurrence of an event during the execution of a program, so that other programs can adapt accordingly; flags validated by one program and used by others, asynchronously, to determine their behavior according to the previous history of the software system, are typi
14、cal examples of control data. For example, the field MT02-33 is preset to indicate the existence of an agreement for taking out the mortgage, which must form the basis of the variation in interest rate to be applied when calculating the instalment; it is a typical example of control data.Structural
15、data are data necessary for managing the organization of data bases or files. The field MT01-05 is a typical example of structural data, because it identifies the record type inside the MORTGAGE file.3. Abstract Data. All the data in the application domain which belong to the logical model and are n
16、ot dead are associated with the corresponding meaningful concept for the application domain 1.4. Analysis of Existing Information. This activity involves identifying the expected functions in the program being reversed using two types of information.The first is static knowledge, i. e. the internal
17、and external documentation of the rules governing the application domain of the function. The second is dynamic, derived from the experience of the programmers and users who interact with the workingprograms.Figure 2. Input and output of Reconstruct Logical Level of Data phaseFigure 3. Equivalence t
18、est of the system before and after reverse engineering5. Reconstruct Logical Level of Programs. Each program is associated with a structure chart in which each module corresponds to a SECTION or an external subroutine of the program. In this phase, both dead data and dead instructions are identified
19、. These are data not used by the program and instructions which cannot be run, respectively. The former are communicated to the Abstract Data activity while the latter are erased from the structure chart , which thus constitutes the logical model of the functions.6. Restore Logical Model. Restoratio
20、n involves introducing changes to improve the structure of the programs and make them easier to maintain, without causing repercussions on the data or interfaces with other systems. Some examples ofmodifications are renaming of variables, making their identifiers more meaningful; extracting modules
21、with high internal cohesion from those with low cohesion and isolating them in the structure (7, 8, 12, 13); externalizing modules which, in the present process, are in line with the main;localizing variables declared to be global but used locally in both existing modules and in processes extracted
22、during restoration. Execution of these activities is facilitated by the expected functions derived from the phase of analysis of existing information. In fact, thanks to this knowledge, the operators can extract the functions from the modules present in the logical model. This makes the logical mode
23、l more readable and its modules less complex.7. Abstract Functions. The functions abstracted during restoration are documented. The aim of each function is described in textual form. The relationships between functions are also documented by means of data flow diagrams. The latter, together with the
24、 description of each function, constitute the conceptual model 9.The reverse engineering process is not symmetrical because the programs are restored while the data are not,because any interference with the latter would affect the procedures and make the whole restoration process very expensive. In
25、fact, restoration of the programs is confined to the instructions of each single program, a much simpler and more economical process.The reverse engineering process described modifies code, so that it is necessary to verify that the working programs are equivalent to those produced by the process. A
26、 test process is used and, as the only reliable component in the working system is the code in question, it is only possible to test the equivalence between the actual and the reversed program. Test cases obtained during normal working of the actual system are used. Theequivalence test is modeled in
27、 Figure 3.3: Operative resultsThe planned process was put in production in the scenario described earlier and after seventeen calendar months of work on a production line, the first results were obtained. It should be noted that a production line refers to an organizational unit which has all the re
28、sources required for executing the process autonomously. In this case, the production line is composed of eight reverse operators and one reverse engineer. The former execute theprocedures according to the defined process models while the latter coordinates activities and takes all the decisions nec
29、essary for solving all indeterminate points in the execution procedures. The production line shares an expert in the application domain with other organizational units in the company.These first operative results can be analyzed from the point of view of both efficiency and efficacy. Although effici
30、ency was not the main aim of this work, the data on the activities for reverse engineering of the programs and of the data are summarized in Tables I and II,respectively. Two important considerations can be made.The productivity of the operators for reverse engineering the programs is correlated wit
31、h their experience in the application domain; this explains the differences seen in Table I. In reverse engineering the data, the difference in productivity has less correlation with experience because there is very little automation of the activities and so the man time required is very high in any
32、 case.The second point is that commercially available tools are often inadequate for large projects. For example, the tool used in extracting the data structure becomes unacceptably slow when access to previously inserted information is required, if the data are more than a thousand or if access is
33、to an entity with more than one hundred attributes. The tool used for the programs, on the other hand, shows an abrupt drop in performance (answers to questions on data and control flow slow down) as soon as the threshold of 8000 lines of code is passed. Clearly, inefficient automatic tools require
34、more man time to attain the objectives.As regards the efficacy of the process, the following points can be made.The logical data model is not very useful to the maintainer because he can read the information he needs more easily from the record layout described in code.This is due to the complicated
35、 structure of the files in the old system, which is difficult to clarify and represent. It is necessary, however, to know the relationships existing between the files that manage the system; there are many of these, so maintenance is a high risk procedure. For example, the relationships between the
36、CUSTOMERS INFORMATION file and the MORTGAGE: CLIENT TAKES OUT MORTGAGE is expressed in the field MT02-01 which represents the customers number and unequivocally identified in the CUSTOMERS INFORMATION file all the information on the above client.Table III. Complexity of some programs and their extra
37、cted modulesDuring restoration, thanks to BACHMANs tool 2 and VIASOFTs tool 14, and to the techniques used,the reverse operator extracts a lot of information which can not be expressed in any of the documents produced.In particular, for many modules in the restored program,he will know not only thei
38、r description but the algorithms themselves contained in the module. In fact,referring back in Figure 1, the document including the description of the behavior of the modules belongs to the flow named restored logical model. Only a textual, not a formal, description of the aim of the module is provi
39、ded for.The data in the files formalize many design decisions taken during the past life cycle of the system, whose reasoning has been lost. As they affect the actual structure of the programs, their inadequate use prevents the reverse operator from being able to extract some functions implemented b
40、y the code, derived from these decisions which have left no traceable reasoning. For example, the field MT02-08, which represents the total amount the client must pay in instalments to pay off the mortgage, is calculated from the mortgage amount(MT02-02), the total number of instalments (MT02-05) an
41、d the interest rate (MT02-03). The design decision has decreed that this datum be stored rather than calculated each time. The main consequence is that many modules identified during the reverse engineering process have low cohesion and high complexity and will obviously be difficult to maintain. Ta
42、ble III shows the complexity of sets of modules extracted from some programs, which is,in some cases, still very high. Performing test cases helps to understand very complex modules.4: Modifying the processAfter the first experimental period on the field, some opportune modifications were made to th
43、e process model and to the product.To increase the reverse operators efficiency,depending on prior knowledge of the application domain, systematic training is necessary. It is not possible to have specialized reverse operators for eachdomain because this would make the system to be reversed” - “oper
44、ator to be used pairing far too rigid.We therefore decided to alter the process, as can be observed in Figure 3, inserting training activities which,by using the expected functions, explain to the operators what they should find in theprograms and in the data they process. This activity is carried o
45、ut by the expert in the application domain and aims to provide the operatorIt is not possible to formalize in the process the information that the tools, even whencommerciallyknown, must be carefully assessed for their efficiency,not only for their efficacy. Fortunately, in this case, the most ineff
46、icient tool was the one which derived the structure of the files, so reconstruction of the logical level of the data was modified (see Figure 2) to produce the classification of the data asconceptual, control or structural, and the description of the relationships between files.Figure 5. Original mo
47、dule for dynamic slicingThe formalization of the modules is included in the structure of the logical model deliverable, whose compilation standard changes. To increase the abstraction of the information on the decisions affectingthe data, it is best that the control and structural data, first, and t
48、he calculated data, successively, be analyzed by the application domain expert so that he can complete the set of expected functions, where necessary. Thus, in the process model, changes have been made to increase thecommunication between the data and the process reverse engineering (see Figure 4).F
49、inally, the experience with the test cases suggested using them to perform dynamic slicing of too complex modules, during the restoration phase. A formal description of this dynamic slicing can be found in 11.Given a module M and supposing it contains a set of functions (F1,., Fn); in the set of tes
50、t cases used to verify the functions of M, some subsets of test cases(T1,., Tn) must be identified, in which the generic Ti contains the equivalence classes of function Fi. Thus the test cases Ti will activate in M all and only the instructions that implement the function Fi. These instructions cons
51、titute the module Mi corresponding to the function Fi. Once the modules (M1,.,Mn) corresponding to (F1,.,Fn) have been extracted, a complementary module Mc to the modules (M1,.,Mn) can be found in M. Mc will be the module managing(M1,.,Mn).For example, for the module shown in Figure 5, the data for
52、which the test cases have been constructed are:IDX-ETI-002 :1,.,20Index of types of transaction tableFLAGTR-WS :0,.,9Work area flag whose value is paired with the index of types of transaction table.The combined values of these two variables specify the exact nature of the transaction the client int
53、ends to make with the bank.KI-0 :true, falseControl variable which states the result of access to the file. If true, access exists, if false, then access to the file was not possible.The value of this variable is tested (in the cases in Figure 6) only on exit from PERFORM, which enables reading of t
54、he files.Thus the original module was substituted by the structure shown in Figure 7.The complexity is modified as shown in Table IV.This is formalized in the reverse engineering process model by modifying the procedure corresponding to the activity “reconstruct logical level of programs. In fact fo
55、r all modules exceeding a predefined level of complexity, dynamic slicing will be performed using the test cases relative to the module in question.Figure 6. Results of dynamic slicingFigure 7. Program structure after dynamic slicing5: Conclusions Table IV. Module complexity before and after extract
56、ionThis paper describes the results of experimentation on the field of a reverse engineering process and theimprovements made on the basis of the data obtained.The feedback gained from the quality of the product and the quality of the process is particularlyenlightening. In this case, the usability
57、of the documentation to understand the programs bettersuggested some improvements to the process. This feedback enabled closer connection to be made in theprocess model between the static and the dynamic information which can be extracted from the application domain and the user context of programs.
58、In addition, a synergetic analysis in the process of learning about the programs was defined. The programs to be documented were so complex that dynamic analysis alone would have been inadequate: in fact, in the original programs, each test case would have activated so manyinstructions that a great deal of man time would have been required to understand them. Instead, using prior static analysis, the programs were decomposed and classified essentially into twotypes:modules whose aim and behavior are clear, and modules whose aim is clear but behavior is not. The latter, more complex
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025年度CIF和FOB貨物進(jìn)出口合同風(fēng)險(xiǎn)評(píng)估與預(yù)防3篇
- 2024年航空公司與在線旅游平臺(tái)之間的合作協(xié)議
- 2024年版權(quán)使用許可合同詳細(xì)規(guī)定
- 2024年餐飲合伙協(xié)議書合同獨(dú)特模板
- 2024年研發(fā)成果試用協(xié)議
- 【名師輔導(dǎo)】2020高考語(yǔ)文二輪復(fù)習(xí)專項(xiàng)訓(xùn)練二-論述類文章閱讀(二)
- 2024正規(guī)離婚協(xié)議范本修訂版起草服務(wù)及婚姻財(cái)產(chǎn)分割合同3篇
- 2024標(biāo)準(zhǔn)采購(gòu)合同范本
- 梳棉胎產(chǎn)品質(zhì)量監(jiān)督抽查實(shí)施細(xì)則
- 推動(dòng)建筑品質(zhì)提升的創(chuàng)新策略與實(shí)踐路徑
- MOOC 傳熱學(xué)-西安交通大學(xué) 中國(guó)大學(xué)慕課答案
- 影視劇本創(chuàng)作與改編策劃
- 藥品配送服務(wù)應(yīng)急預(yù)案
- 山東省青島市市北區(qū)2023-2024學(xué)年七年級(jí)上學(xué)期期末地理試題
- 2024年?yáng)|方航空人力資源管理西北分公司招聘筆試參考題庫(kù)含答案解析
- 2023年海南省公務(wù)員錄用考試《行測(cè)》真題和答案解析
- 2024年人事行政工作總結(jié)與展望
- 冰晶石生產(chǎn)工藝
- 倉(cāng)庫(kù)風(fēng)險(xiǎn)應(yīng)急處置預(yù)案
- 提高員工服務(wù)態(tài)度與客戶滿意度
- 草本植物飲料研究預(yù)測(cè)報(bào)告
評(píng)論
0/150
提交評(píng)論