In healthcare documentation, one size does not fit all. Classical on-line computing, based on form entry rather than hierarchical outlines, has a limited ability to identify and track the differences that separate one individual’s illness and proper treatment from others with the same disease. Such differences matter. Due to fundamental biological variation many of the typological categories used to classify disease and select therapy are fuzzy, and only yield oversimplified results when classical logic is used. XML is ideally suited to establish truly parsable marked-up data bases for individualized care so that powerful modern computing modalities can be applied to these data. This suggests a broad task for improving the effectiveness of evidence based medicine over the next ten years: to bring the expertise of advanced computing science to Medical Informatics based on much improved “evidence.”
All | Details | |||
No entries found.
XML, by virtue of its flexibility and the suite of text annotating and text processing tools it can provide, enables health care computing to solve numerous problems that have held back patient record processing for more than thirty years. It is easy to see why. The critical factor limiting on-line healthcare documentation and computing is the devil that lies in the details: namely, all of those differences that separate one individual’s illness, and treatment from that of the next person with the same disease. In healthcare one size does not fit all. This is by no means new information. We need only to be reminded that a surgeon operates on one patient at a time and that each time a procedure is carried out it is different in some way from any other. Thus, it remains up to the art of medicine to adopt our very advanced science to the specifics of each patient’s situation. The matter is closely akin to engineering. Whenever a bridge must be built, the science of bridge building, the available technologies and the choice of materials must be fitted to the unique aspects of the particular site. Today in medicine, the organizational complexity that has accompanied our recent scientific and technological advances has made it more and more difficult to match the proper therapy to the specific patient. Efficiency has tempted us to automate the whole process in some standard way, either with paper forms or by computer. There are two parts to the problem of individualized care: documenting the relevant information, and then applying it. The hand written outline format, long used in generating paper based patient records, if followed with reasonable care, accomplishes the necessary capture of important individual differences quite well so that practicing physicians are very reluctant to give it up. At the same time, given our present knowledge, such hand written documents cannot take advantage of the enhancements to medical judgment that computing power can provide. However, in stark contrast to these flexible outlines on paper, the fixed forms and formats of most legacy and present day healthcare information systems (HIS) are typical of their computing architecture and also appear as paper forms for transcription. These squeeze out patient specifics to favor the consistency of a narrowly structured data base. Computing with such data bases is easy enough, but the necessary subtlety contained in patient differences goes missing. Fixed formats completely ignore the fact that the “average patient” and the expected “average circumstances” on which their data fields have been based are misleading statistical fictions. There is no average Everyman. Moreover it is the patients who deviate the most from the average who generally require the most attention and care, the most documentation, and the most justification. Indeed, 80% of physicians’ time is spent on the 20% of the patients who are not under good control. It is important to recognize that the frustrating variations that appear everywhere in human biology are fundamental, not superficial. In fact, they are why we are here at all. Ernst Mayr, one of our greatest epidemiologists, conclusively demonstrated early in the twentieth century that if biological species were entirely fixed by an unalterable underlying chromosomal code there could be no evolution! (And, of course, there is!) Instead every species consists of a population of distinctly different individuals that result from variations expressed through their genomes. Then, under the selective pressures of competition, those individuals within a species flourish or at least survive who are better (but not always best) adapted to their immediate situation [1,2,3,4]. Thus, although we humans resemble each other rather closely, each of us has a specific DNA sequence which defines our unique genetic make-up and can be used to identify us. These differences, plus those that develop over life, can be all important with respect to health. They count heavily when we are challenged by disease. Healthcare is about helping individual people alive today to survive and flourish despite such challenges. However, there is also a dark side to variation and evolution: every year we are faced with a new and somewhat different influenza virus. The AIDS viruses vary and escape our control. Antibiotics loose their efficacy because microbes, by their random variation and sharing of information, become immune to a succession of such products. The same phenomenon holds true for crop control. Moreover, biological variation poses another very deep problem for medicine: that many of our most important diagnoses, the typological categories used in classifying disease and selecting therapy, cannot be identified and defined by a set of necessary and sufficient conditions in classical, black and white Aristotelian fashion. Some benign groups of bacteria include dangerous members. The E-coli group, which each of us carry in our gut, also includes the O157:H7 lethal strain present in cattle that can be ingested in incompletely cooked hamburger. Many diagnostic categories shade off at the margins into other categories. Our tumor classifications are often only approximate. In many critical situations there turn out to be no perfect rules that can clearly identify the limits of a particular cancer diagnosis and separate it cleanly from neighboring ones. Just as in matching sets of fingerprints, the last stage in judgement is often an art, not a science. Yet these judgments about diagnoses advise us about prognosis and proper treatment. Biological variation is everywhere. Blood vessels and nerves usually follow expected anatomic paths from one individual to the next, but there are always exceptions. For example, occasional individuals have their organs reversed, with the heart on the right side and the liver on the left. In a more trivial example (but a particularly embarrassing one for me in my first “procedure” as a physician in training), two percent of the time the great toe is enervated from below by the plantaris nerve rather than from above. This doesn’t seem like a lot, but it applies to one out of fifty patients. Moreover, it can matter. In such cases the classic ankle block anesthesia used in anesthetizing and removing a great toenail will not work. My first patient, a voluble paraplegic Italian lady left the operating room yelling: “Fakers!” As we learn more and more about molecular biology, it is not hard to see why similar issues occur at every turn and every level. Consider cystic fibrosis (mucoviscidosis). In this relatively common genetic disease long sequences of enzyme related reactions are involved in making the proper kind of lubricating mucous in the bronchi that must be of the proper consistency to be cleared from the lungs. All of these steps are created by portions of the responsible genetic code. Moreover, the same genetic defect can cause serious disease in one individual and be well compensated for in another. Thus, the more we know, the more evident it is that many of our guides to therapy and prognosis can never be more than imprecise fuzzy sets in which individual differences can be crucial. Similar statistical considerations apply to the “normal” range of human development. Some children talk at nine months, others at three years. Indeed, fuzzy categories are pervasive in biology and in human social settings. We cannot and should not try to avoid them. Steven Pinker cites example after example. Here is one: “... the mind distinguishes between a fuzzy boundary, and no boundary at all. ‘Adult’ and ‘child’ are fuzzy categories, which is why we [in America] could raise the drinking age to twenty-one or lower the voting age to eighteen. But that did not put us on a slippery slope in which we eventually raised the drinking age to fifty or lowered the voting age to five. Those policies would really violate our concept of ‘child’ and ‘adult,’ fuzzy though their boundaries may be... [we can nevertheless apply them judiciously]” [5]. A tradeoff among practical concerns decides (or should decide), at least for a particular time, in a particular context, with particular objectives and at a particular state of knowledge, where the boundary should be set [6]. Moreover, in specific circumstances even these boundaries can allow for exceptions that are the result of the fuzzy nature of the underlying concepts and the still incomplete multivariate considerations behind their formulation. Following the example above, we might cite the growing (but often dubious) practice of trying some minors as adults for major crimes in the US. In constructing categories that impinge on or involve powerful emotions, or when overriding the boundaries of our constructs, it is important to hew as closely as possible to known facts and their consequences [6,7]. Such indistinct thresholds violate the rules for computed calculations using a black and white logic which are so amenable to everyday computing and so often favored in argument. Thus, in implementing effective automated enhancements to the logic of medicine, it is variation and variety that must first be addressed properly and incorporated in the subsequent calculations. XML is uniquely able to provide this subtlety with respect to the collection and organization of data as it is enhanced by mark-up to become information. Under the aegis of the various medical computing standards committees in coordination with the W3C consortium extensive work on patient record markup has advanced rapidly [7]. The Clinical Document Architecture (CDA) under HL7 (Health Level Seven) is largely based in America [8,9], and is closely allied to the coding schemes under CEN, the European Community for Standardization, which has the particular objective of crossing language and cultural boundaries. The Net can reveal many more examples [10,11]. However, to actually enhance judgment such XML data bases must then be linked to other powerful programmable concepts from general problem solving. Moreover, from a broader computing perspective, health care can offer a particularly appealing test-bed for theories of reasoning about real world situations that lie outside of medicine. Characteristically messy and complicated situations exist in many domains and involve similar logical quirks and peculiarities arising in handling concepts or categories that are open sets. Health care offers this advantage not only because of the widespread (and fundable) interest in medicine itself and its open, unclassified nature in a time of secretive science, but most particularly because of the large number of real world cases that can be collected, the complexity of the subject matter, its encyclopedic scope and the near immediacy of the results. In real world medical situations the judgments made and the actions taken have concrete outcomes that can be tested against projected expectations [12]. In organizing fuzzy categories for computation, of particular note is David Heckerman’s thesis on Probabilistic Similarity Networks [13]. Grounded specifically in health care, this Ph.D. thesis won notable recognition from the computing community for its broad contribution to the field of expert judgment. By combining knowledge maps and influence diagrams into belief networks, it broke new ground for sound, formal ways to describe complex conditional open relationships in a probabilistic environment. The original source data consisted of a collection of anatomic slides from lymph node pathology where the diagnostic entities exhibit a high degree of morphological ambiguity so that the judgments of different domain experts, based on their experience, were often at considerable variance. Thus, for each diagnosis a gold standard was established as the aggregate of a set of observations weighted by the subjective evaluations of a recognized group of such experts [14]. The kinds of diagnostic distinctions addressed in this way were previously well beyond the reach of classic automated decision tools. This methodology and others like it can be applied widely in different domains of expert system decision making. In the past, for such a powerful generalization to arise out of Medical Informatics has been unusual, largely due to the labor intensive problem of sufficiently sophisticated descriptive data input. (The one exception has been radiology, where the graphic input is already digitized.) Once the input problem is fully addressed numerous tools are waiting in the wings to process the data: These include Multivariate Coding [15,16], Concept Indexing [17], Object Oriented Computing [18] and Fuzzy Logic processing [19,20]. After an initial period of contentious debate, these different approaches are now seen to be largely complimentary. A primary task of the next ten years will be to move the still somewhat parochial field of Medical Informatics more directly into the interactive computational mainstream. In doing so, we welcome all of the help we can get. With the development of truly parsable marked-up data bases, the expertise of workers from other fields can be fruitfully involved. There are bridges to be built. References: 1. E. Mayr, “Biological classification: Toward a synthesis of opposing methodologies.” Conceptual Issues in Evolutionary Biology, E. Sober, ed., MIT Press, Cambridge, MA, 1984. 2. E. Mayr, Animal Species and Evolution, Belknap Press of Harvard University Press, Cambridge, MA, 1963. 3. E. Mayr, Systematics and the Origin of Species, from the Viewpoint of a Zoologist, Harvard University Press, Cambridge, MA, 1999. 4. E. Mayer, What evolution Is, New York, Basic Books, 2001. 5. S. Pinker, the blank slate: The Modern Denial of Human Nature, Viking Penguin, New York, 2002, pg. 229. 6. C. R. Sunstein, Risk and Reason: Safety, Law, and the Environment, Cambridge University Press, Cambridge, 2002. 7. T. L. Lincoln, “Codifying Medical Records in XML: philosophy and engineering.” R. Khare and D. Connolly, eds., XML: Principles, Tools, and Techniques, O’Reilly & Associates, Sebastopol, CA, 2 (4):149-152 (1997). available at http://admin.xml.com/pub/a/w3j/s3.lincoln.html 8. The HL7 Clinical Document Architecture R. H. Dolin, L. Alschuler, C. Beebe, P. V. Biron, S. L. Boyer, D. Essin, E. Kimber, T. Lincoln, and J. E. Mattison J Am Med Inform Assoc 2001;8(6):552-569. 9. R. H. Dolin, W. Rishel, P. V. Biron, J. Spinosa, and J. E. Mattison, “SGML and XML as Interchange Formats for HL7 Messages.” JAMIA Fall Symposium Supplement, 720-4 (1998). 10. A. Hadley and C.Hutchings 1.25 Million Electronic Patient Records in XML at Poole http://www.gca.org/papers/xmleurope2001/papers/html/sid-04-08.html 11. Various Web sites: www.chartware.com for a private practice documentation system, www.verinform.com for marked up clinical trials and other hospital based applications, http://www.hl7.org 12. P. R. Cohen, “Control Conditions in Mycin: A Case Study”, http://www.cs.colostate.edu/~howe/EMAI/ch3/node5.html 13. D. E Heckerman, Probabilistic similarity networks, (ACM Doctoral Dissertation Award) MIT Press, Cambridge, MA 1991. 14. B.N. Nathwani, et al; “Integrated Expert Systems and Videodisc in Surgical Pathology: An overview.” Human Pathology, 21 11-27 (1991). 15. R. A. Côté and D. J. Rothwell, et al., eds., SNOMED International : the systematized nomenclature of human and veterinary medicine (3rd ed.), College of American Pathologists Northfield, IL, American Veterinary Medical Association, Schaumberg, IL, 1993. 16. J. R. Campbell, et al. (CPRI Working Group on Codes and Structures); “Phase II Evaluation of Clinical Coding Schemes: Completeness, taxonomy, mapping, definitions, and clarity.” JAMIA, 4, 238-251 (1997). 17. B. R. Schatz, “Information retrieval in digital libraries: Bringing search to the net.” Science, 275, 327-334 (1997). 18. J. Pritchard, COM and CORBA Side by Side : architectures, strategies, and implementation, Addison-Wesley, Reading, MA, 1999. 19. L. A. Zadeh, The Role of Fuzzy Logic in the Management of Uncertainty in Expert Systems, Electronics Research Laboratory, College of Engineering, University of California, Berkeley, CA, 1983. 20. C. T. Leondes, ed., Fuzzy Logic and Expert Systems Applications. Academic Press, San Diego 1998.