TODAY: 06 January 2009

Main Menu
Home
About
News
Joining
Links
Contact Us
Search
Newsfeeds
FAQs
Newsletter
Login
CB Workflows
You are not authorised to view this resource.
You need to login.
CB Online
No Users Online



2006-01 MPO Newsletter PDF Print E-mail
Meta-Data Professional Newsletter
January/ February 2006

 

In This Issue
  • MPO Membership Meeting
  • Meta-Data Industry Conferences & Events
  • Special Discounts for MPO Members
  • Vendor Sponsorship Opportunities Available
  • Of Wikipedia and Dictionaries: Dictionary on a Shoestring
  •  

    The 2006 MPO Board Members
     

    • President: Davida Berger / DebTech International
    • Secretary: Diana Wild / FM Global
    • Treasurer: Bill Brooks / MFS Investment Management
    • Operations: Rob Dickens
    • Membership: Ekkehard Schwarz
    • Industry Relationships: Seth Earley / Earley and Associates
    • Online Services: David Plotkin / Wells Fargo
    • Newsletter: Nolan Madson / Baseline Consulting
         
         
    MPO Membership Meeting
    MPO Membership Meeting
    Feb. 28, 2006
    Sheraton, Studio City, Orlando, Fl
    Join us for a membership meeting in Orlando on Feb. 28th
    Meet members of the MPO and Advisory boards.
    Meeting topics include:
    • MPO progress since formation
    • Plans for 2006
    • Discussion of metadata survey results on issues practitioners are facing in metadata management and how the MPO will help
    • Working Group Sessions on Inexpensive Ways to Manage Metadata led by Bonnie O’Neil, recognized expert on business meta data, data architecture and business rules

    The meeting will be held at the DebTech Meta-Data and Data Modeling Summit Sheraton Studio City. You do not need to attend the conference in order to attend this meeting however pre-registration for non- conference attendees is required.

    For pre-registration please email This e-mail address is being protected from spam bots, you need JavaScript enabled to view it

    MPO members are eligible for discounted registration to attend the conference. For conference information please visit www.debtechint. com.
     
    Meta-Data Industry Conferences & Events

    Here is a list of meta-data industry conferences and events that offer discounts to MPO members.
    • February 27 - March 1, 2006 (Orlando, Fl) ~ DebTech International presents the 2006 Meta- Data and Data Modeling Summit ~ $280 discount www.debtechint. com
    • February 28 – March 2 (London, UK) IRM UK presents Zachman Enterprise Architecture Series, John Zachman and Stan Lock 10% discount www.irmuk.co.uk
    • March 6 –March 9, 2006 (San Jose, CA) Semantic Technology Conference $100 discount www.semantic- conference.com
    • March 9 –March 16 (London, UK) IRM UK presents Peter Aiken - Information Architecture, XML and Data Management, Practical Metadata Strategies 10% discount www.irmuk.co.uk
    • April 10- April 11, 2006 (Pittsburgh, PA) KIK Consulting presents How to Build and Implement Data Governance & Data Stewardship Program- Bob Seiner (call for MPO discount) www.kikconsulti ng.com
     
    Special Discounts for MPO Members

    MPO has negotiated the following vendor discounts:
     
    Vendor Sponsorship Opportunities Available

    Attention meta-data vendors and consultants ~ are you interested in sponsoring an MPO webinar, teleconference, or meeting ? Please contact Seth Earley, VP of Industry Relations at This e-mail address is being protected from spam bots, you need JavaScript enabled to view it
     
    Of Wikipedia and Dictionaries: Dictionary on a Shoestring

    Bonnie K. O’Neil

    Project Performance Corporation

    Abstract

    This paper is about launching a dictionary initiative on a shoestring, using Bonnie’s Law: Use Whatever Technology is Lying Around.  The chronicles of a media company that went through a major migration will be presented, illustrating how they were left with all sorts of conflicting business terms (not to mention data elements too!), and how they solved the problem, with no budget at all.  Lessons learned from the exercise include:

    • Why dictionaries are important
    • How to launch a business metadata initiative when your client doesn’t know what that is
    • What the organization gains by this initiative
    • Basic shoestring principles: How to launch all sorts of metadata initiatives with no budget

    Introduction and Project Background

    There have been several events that have transpired in my last client which have created the need for making a common corporate glossary[1] explicit.  Perhaps the most notable of these was a migration to a new commercial off-the-shelf (COTS) software system, which essentially manages the heart of the business.

    As you would expect, this system has its own terminology and this has caused the entire business to learn a new vocabulary.  Terms that are important to the business are important enough to expend time and resources in tracking them and making sure everyone in the enterprise is “speaking the same language”— literally. We have therefore identified the need for a corporate glossary.

    A corporate glossary also sets the stage for "the semantic web," intelligent search, and real knowledge management: The establishment of a corporate, shared knowledge base where different groups within the organization can learn from the successes of others. 

    This article will share a case study in launching a corporate glossary. In the process, we will explore the lifecycle of a term and discuss the beginning of a governance process for business terms.  I will be exploring our journey to the creation of a knowledge base.

    The next step after a glossary is to expand and build it into an encyclopedia of sorts. Along the way, we will be looking at infrastructure, lifecycle, architecture, and governance. We have discovered some exciting things in our adventures, and I look forward to sharing these nuggets of knowledge with you.

    The Importance of Definitions

    Many errors and accidents are made or caused by misunderstandings of the meanings of terms used.

    How many times have you been in a meeting when the words you heard being said did not match what you thought they were? 

    Many business decisions are made (and later regretted) due to a misunderstanding of the data, and what the data element used in a report is signifying.  Some of these accidents and misunderstandings are large enough to be reported in the media.

    In prior papers I refer to the Mars Lander episode, where the unit of measure was assumed and not made explicit, miscalculations were made and the equipment was lost.  Our businesses are filled with many such examples, although not as costly perhaps, are still quite impactful to the business.

    Context is everything.  The English language is full of meaning nuances; a word may have multiple meanings based upon the context that it is used.

    Business metadata is all about adding context to data.  A Dictionary or Glossary is part of business metadata, and it is all about making meaning explicit and providing definitions to business terms, data elements, acronyms and abbreviations. 

    Business Function of a Glossary

    In the normal flow of business, a dictionary or glossary is useful for the following:

    Orientation and indoctrination of new employees.  The corporate culture of any business is filled with both industry-specific and company-specific terms that baffle new employees.  A dictionary helps these new employees get up to speed quickly.

    For personal clarification.  An employee is using an application or reading a document and comes across a word or phrase that is contusing. The dobsonfly can be accessed immediately to clarify the work he or she is engaged in.

    “Linguistic Arbitration” (a wonderful term authored by my friend James McQuade) Two or more individuals or groups have different definitions for the same business term. A dictionary can assist in clarification of meanings, and help then come up with definitions that they all can agree to.

    Data Harmonization. The most important purpose of a corporate glossary B to arrive at terms which everyone can agree on the meanings and everyone can use these terms with common understanding.

    Claudia Imhoff uses the analogy of “singing off of the same sheet of music” so I call this “data harmonization.”  This is critical for any type of data integration work, including data warehouses and data migration projects.

    Metadata Initiative on a Shoestring

    A dictionary is therefore an instance of metadata. Furthermore, it is Business Metadata—metadata that will be consumed by business people.  Metadata is very abstract; most business people don’t understand it. Yet it has tremendous value, as we have seen above. So how do you procure funding for something like this? My answer: You may not have to. You can do a lot by stealth.  Use Bonnie’s Law: Use Whatever is Lying Around. This case study depicts how we used this as our operating principle.

    Technology Requirements to Support the Function of the Glossary

    The purpose of the glossary therefore is to provide anyone with the definition of any term or data element name, no matter where they are or what software they are using. 

    In order to provide this capability, the glossary must be ubiquitous—available from anywhere in the organization, at any time, and always accessible by everyone. It should be easy to use, and require as few keystrokes as possible. It should also be able to be modified by not just the subject matter experts (SMEs) but also by anyone in the company. 

    Since we had no budget, however, we couldn’t have a programmer create something that then had to be deployed on everyone’s desktop. So we looked around for something that was available on everyone’s desktop land it was the Portal. Our portal is the perfect place for the glossary to reside; it comes up when you click the Internet Explorer icon in the tray.

    So we used Plumtree, which in this case was “lying around.” I am not recommending that you go out and buy it if you don’t already have it. Be a detective and see what is available in your environment.

    Plumtree has a facility called Studio that enables you to create portlets with no programming. For us this was a huge advantage, because not only did we have no budget for a programmer, it also enabled us to deploy it without having to go through the rigorous testing that coded applications were subjected to. This meant that we could release it sooner.

    Governance Lite

    If anyone is allowed to create or update a dictionary entry, some control is needed to reconcile terms cross-functionally, with other groups in the organization who may have different uses for a term.  This creates the need for governance.

    We have all experienced data dictionary initiatives that have been burdened with too much governance.  The governance gets in the way of flexibility: The business needs to be able to change the definition when the business itself changes. The resulting situation involves definitions that may have been correct at one point in time, but as things change, they never are in synch with the business as time goes on because it takes an act of Congress to change the dictionary.  We have therefore created what we call "Governance Lite" in order to create a flexible structure to accommodate both the need for governance and the need to keep constantly in synch with the business.

    Governance Lite works as follows: Anyone in the organization can create a Business Term entry in the glossary. When a new term is created, it has a state of "Candidate." Anyone can see all candidate terms, and the state will be shown to users of the Glossary.

    We have a "Terms Team[2] " whose job is to rationalize and normalize terms.  The Terms Team is electronically notified when a new term is entered into the glossary. The team then researches the term with other terms already in the glossary and makes sure there are no conflicts.

    The team makes sure the wording of the definition is accurate, and that the definition follows the format mentioned in my previous article. Business Metadata: How to Write Definitions published by TDAN (http://tdan.com/i032fe01.htm ) and B-Eye (http://www.b-eye- network.com/view/734)

    Sometimes this work may require researching reference documents, and/or contacting line of business SMEs directly.   When the definition has been successfully researched, its state is changed to “Authorized.” 

    The business is alerted to its use of a Candidate term, since the term is displayed in the glossary along with its state. The business should take steps to clarify the meaning of Candidate terms whenever they are used, since it is possible that a candidate can have more than one meaning. 

    “Wiki” Updating

    “Wikipedia” is an open source encyclopedia on the internet. What’s cool about it is it’s the “People’s Encyclopedia”—anyone can update an existing entry or add a new one. In this way, everyone can participate in it and “own” it. On the downside, it can be very chaotic, because it lacks governance.

    We are trying to strike a balance and enable everyone to feel like they can contribute, therefore only applying minimal governance.  Where it gets interesting is when someone wants to update an existing entry, especially someone else’s entry.  This is when the governance is really needed.

    How We Are “Faking” Wiki

    As described above, we used Plumtree Studio, which enabled us to implement the glossary very quickly.  The tradeoff is you cannot do any custom development.  We had to then get creative in how we implemented it.  Since the “plain vanilla” implementation does not offer versioning, we decided that every time someone wanted to edit an existing term, they can click the “edit” button (even on a term submitted by someone else) and they would be allowed to edit it. 

    The system would create a new term behind the scenes, using a database trigger.  So if they were editing an Authorized term, the dictionary would essentially have both: the older version, which is the Authorized one, and the proposed new one, which would be a Candidate.  The “new” one would contain the edited version of the definition. So what we are doing is not a true wiki because we need the entries for both before and after the edit in order to apply governance and resolve conflicts. 

    Resolving Conflict

    Suppose a business term has been entered more than once in the glossary, with two different meanings. There are several ways this can be resolved:

    Add a modifier to the term to create a new term. There may be a general definition of customer, but Marketing uses the term differently so you can have a "Marketing Customer."

    In semantics, there is a notion of "word sense," i.e., a different “sense” of the word. For example, the word "mole" has different meanings, such as:

     - a small furry mammal,

     - an indentation in the skin,

     - or a spy.

    In our glossary, we will be using what I call "enumerated definitions," meaning a term will have numbered definitions.  We have not decided whether the order of the definitions will reflect the most-to-least commonly used definition. I think for now it will be a "first in" type queue—the first definition received will be the first shown.  Therefore, every term will have only one “Authorized” entry. 

    Replace the term—discussed in the next section.

    Replacing and Merging Terms

    The Term Team examines two or more existing definitions for the same term, either from separate submissions by different people or the modification of an existing term.  Here are the resolution scenarios:

    One of the definitions looks better than the others: it is more complete, language is more precise, etc.  In this case, one term will replace the others.

    Elements of all the candidate definitions can be merged together to create a new definition. 

    One definition may be incorrect, or may apply to a different term.

    In all of these cases, one definition will replace others.  The replaced term will have a status of “Replaced” and there is a data element Replaced By which would contain the Term ID of the new term. 

    Lifecycle of a Term

    The states of a term are:

    1. Candidate
    2. Authorized
    3. Replaced
    4. Retired

    Sometimes it will be determined by the business that a term is not useful anymore and is no longer part of the common business language.  In this case, its status will be Retired.  We are still setting policies concerning retired terms.  Right now, our intention is that if the Term Team uncovers a term that should be retired, the status will be set to Retired, and if no one comments on this or edits the definition, it will cease to be an active part of the glossary and will no longer be displayed to users.

    Using the Glossary: How Search Works

    The first step in a glossary search is accessing the main portal page by launching the browser.  There are two portlets for the glossary, both shown on the main portal page: one for search and one for submit.  The search portlet has only one entry area, for the term name, along with the search button.

    The results window displays one definition, with an arrow indicating more rows found if they exist.  The results are sorted by the state of the term, so if there is an Authorized definition it will show first.  Otherwise, the results are sorted secondarily by date entered, so if there are multiple candidates, the most current will show first.

    The submit portlet allows the user to enter a term name and a definition if desired.  We want users to be able to submit a term even if they don’t know the definition.

    Publicity

    One of the immediate goals of our project is getting the business to be familiar with the glossary and to use it. We launched a publicity campaign, which included a contest, to generate excitement and awareness of the glossary. Every time someone enters a term, their name will be entered into the drawing to win one of two gift certificates for company merchandise like polo shirts, tote bags, etc.  Two articles appeared on the corporate portal, as well as an article in the e-magazine, and posters in each regional office announcing the contest.  The gift certificates were donated by my boss.

    The purpose of the contest is twofold:

    To collect as many terms from actual business people as we can, and

    To get people familiar with the glossary so they can start using it in their daily work.

    The Future of the Glossary: Semantic Web and Encyclopedia

    The Terms Team in the background will quietly be collecting synonyms.  Eventually we want to display synonyms to the user when the definition is shown.  This will require customization, so it will be implemented in a later release.  It is our goal at this time to have the display limited to the term name, the definition, and its status.  Synonyms and related terms lead to ontological information—terms that can be related and can therefore supercharge searching. 

    The stage is being set for semantics to empower search.  Although the tools are not here yet, it helps to be aware of the emerging standards, such as Resource Description Framework (RDF) and Web Ontology Language (OWL), which will enable related information to be tagged for easy navigation.  We see that our dictionary work is preparing us for semantically enabled search capability when it arrives.  At the same time, another group is gearing up for the creation of an in-house taxonomy for corporate search.  Our glossary will help define the base terms used in this taxonomy.

    Our next evolutionary step for our “wiki-glossary” will be a corporate “wikipedia,” like its namesake on the web.  We want to be able to enable business people to capture the largely unstructured data that they uncover every day that helps them perform their jobs better.  We want to create a way that when people make a discovery of knowledge that helps them, they can enter it in the wikipedia as an entry (sort of like a business term entry but more detailed) and others can search on it.

    We already have a document library, but it contains only formal documentation on business processes.  We would like to capture all the informal knowledge embedded in “carbon-based life forms” so others can benefit.  Everyone knows that most business rules live informally, the large majority in people’s heads.  If we can create a very simple tool that makes it easy to capture this informal bed of knowledge, then the corporation will benefit greatly.  This is our next step, along with the creation of a “wiki-ontology,” a classification of corporate knowledge that can be grown from the business people themselves. 

    Generating Business Value

    Our glossary is making the business community aware of the meanings of the words they use.  We see confusion every day in word usage that often translates into misunderstood business metrics, which can have a drastic impact on decision-making.  It is our hope that we can clarify terms so systems and metrics can be understood better, which will eventually have an impact on the bottom line.

    About the Author

    Bonnie O'Neil is Senior Technical Consultant at Project Performance Corporation, and is an internationally recognized expert on business meta data, data architecture and business rules.  She is a regular speaker at many conferences, and has also been a workshop leader at Meta Data/DAMA Conference, Oracle ODTUG, and the Business Rules Forum; she was the keynote speaker at an international conference on Data Quality in South Africa.  She was a founding member of the Guide Business Rules Project (a standards group for business rules) and also the ODTUG Business Rules Summit.  She has been involved in strategic data management projects in both Fortune 500 companies and government agencies, and her expertise includes specialized skills such as data quality, profiling, semantic/data integration and migration.  She is the author of two database books including Oracle Data Warehousing Unleashed, as well as over 40 articles and technical white papers.   She is a Certified CIF/GIF Architect by Bill Inmon, the Father of Data Warehousing, and is a certified trainer for Mr. Inmon.  She is currently working on her third book, co-authored by Mr. Inmon, on Business Metadata.

    Acknowledgments

    Thanks to my client, Dex Media and specifically David Neitz, for funding my work, and whose enthusiasm for the “wiki” helped me tie everything together.

    Thanks to the fabulous Terms Team: Helen Atwater, Connie Boston, and Sue Blowers.  Without them, this case study wouldn’t exist.

    Thanks to David McComb, who is my tutor in the semantics space, who wrote a wonderful book entitled “Semantics in Business Systems” to whom I am indebted for the “Word Sense” area and the Mole example.

    Thanks to Lowell Fryman, whose humorous analogies are always a delight, to whom I owe the notion of the most ubiquitous metadata repository known to humankind: the carbon-based life form.  Lowell is my co-presenter at RMOUG.

    Thanks to James McQuade of Giant Eagle.  He is implementing an approach in knowledge artifact management that is similar to the next evolutionary step for our glossary.

    Lastly (but not least!) thanks to my wonderful family: Perry, Tyler and Christopher, who considerately left me alone to write this article.

     © 2006 Bonnie O’Neal


    [1] Author’s note: In this paper, I use the words “dictionary” and “glossary” interchangeably.

    [2] Note: Even though we did not have budget, we had to get buy-in from the managers of employees who would serve on the terms team. One of these managers was also my boss, who was already sold on the idea of business metadata. The other was a business corporate librarian, who loved the idea-it was right in her area of expertise.

     

     
    What is the Meta-Data Professional Organization?

    The Meta-Data Professional Organization (MPO) is a non-profit, international professional association comprised of business and IT professionals in all areas of meta-data practice, including administrators, developers, architects and managers.

    The MPO brings together individuals with interests, expertise, or hands-on experience in meta-data use from all areas of private and public enterprise throughout the world and seeks to disseminate technical and professional information to meta-data practitioners of all levels of experience.

    The MPO provides meta-data professionals with a community that fosters discussion, advancement and increased understanding of meta-data as it is applied in the field.

    Are You Ready
    to Join the MPO ?

     

    The Meta-Data Professional Organization is actively seeking both companies and individuals to join its membership. We encourage you to join the MPO today and start reaping the benefits! Members will gain:

    • one-stop shopping for meta-data research and resources
    • network with other meta-data professionals in your own or other fields
    • access Members-Only web content
    • read or be referred to Articles of Interest on meta- data
    • interact on Discussion Groups
    • monthly Email Newsletter
    • special "MPO Member Discounts" on seminars, conferences, & other events

    See the MPO web site for current membership information. If there are more than 4 individuals in your organization, we recommend a Corporate Membership, which allows an unlimited number of people within your organization to join. Student Memberships are offered to students currently enrolled at any accredited educational institution. All memberships are renewed annually, with fees due on the anniversary of the membership start date.

    MPO MENTORSHIP PROGRAM ~ calling all Meta-Data Mentors!

    One of the goals of the MPO is to provide an environment conducive to sharing experiences with each other. If you are interested in sharing your meta-data knowledge and experience with others, please email us at membership
    @metadataprofessional.org

    Stay Tuned: details regarding the mentor program will be posted (soon) on our website!

     


      SUBSCRIBE to the MPO Newsletter! Enter your email address and press
    Last Updated ( Tuesday, 26 February 2008 )