Monday, April 6, 2015

Be a Thoughtful Solution Provider

No wonder IT architects, in general, get paid better than other IT professional, because the IT architect can, at a large degree, decide the company's IT future. Hence, the architecture laid out by the architect has profound relationship to the company's IT ROI. A thoughtful architect could save big money for the company, while a sloppy decision would take away a big chunk of IT budget (even worse is that management probably doesn't notice it at all).

Giving one example with my experience, I was a solution architect working together with another guy for implementing BPM (business process management) of so call money movement (with a financial company, there are 4 types of moving your money around: ACH, check, wire and journal). We came out a nice logical architecture model consisting of IBM BPM, IIB, MQ, Oracle databases and mainframe (Non-stop) components. Going along with this model, we have to create about 8 services providing more than 20 operations to realize the architecture. We presented the architecture model and initial service specifications to the management & team, and got everyone thumb up. However, I was suspicious about the complexity of the whole architecture. I let my thought flow around, and debate this architecture with myself after working hours with a fresh mind. After spending time back & forth, I found there is absolutely no need of one component in this architecture. I removed it from the model, and surprisingly I was able to cut the services to only 4 services and 8 operations after removing this component. Voila, how much I saved for the company, I estimate it would be at hundred thousand range (not only the reduction of development effort, but also support and maintenance, etc).

Currently I am facing a similar situation where a thoughtful architect could make a better (or wise) decision. The situation is a typical integration use case where an incoming message to IBM IIB (my client names it as EDB - Enterprise Data Bus), and then to data service layer  (EDH - Enterprise Data Hub). The use case is to look up or match a customer by a few attributes such as an unknown ID (could be a customer ID already in the system, but not necessary), first name, last name, DOB, gender and address. Initially, a solution was proposed by a guy that IBM ODM (Operation Decision Management, aka rule engine) will host the matching logic. In fact, the matching logic is like: (1) Use the unknown ID as the search criteria to find customer(s); (2) If one customer match is found, stop here. Otherwise, add other attributes such as first & last names into the search criteria and continue the lookup. Again, if one customer match found, stop here. Otherwise, add other attributes, and repeat the similar step above. To me, the matching logic is very similar to what happens at database side. For example, taking rational database as an analogy, the matching logic can be sql query with SELECT, JOIN and WHERE clauses. Furthermore, it is very clear that database query could be very efficient (as long as you don't use database cursor) no matter it is a rational type database or no-sql database. On the other hand, if the matching logic is implemented in ODM, loop into each record (where more than one match is found) would take place. We all know that the iteration of each record could be inefficient. Therefore, I was the person starting questioning about this solution. I raised the question a few times during a couple of discussions like " Do we know if ODM can do this matching logic more efficient, comparing to database query?". Another question I asked people is: "is this matching logic suitable for ODM?" considering there two major criteria of using ODM: one is that the business rule would be changed often; the second is if the rule implemented in ODM can be modified by business people. Another point is that adding ODM into the whole solution creates more complexity in terms of infrastructure as well (for example, high availability, failover, etc). I am not against using ODM, but I want to use it in a right way with right use cases.

What I am trying to say here is that a thoughtful architect is a valuable asset to any IT organization. To be a thoughtful architect, she/he must be equipped with some characteristics such as:
  •  Never take any proposals/solutions as granted, always be ready to provide arguments about pros and cons of the solution. Even someone tells you that his solution follows existing pattern, you still have to keep debating and questioning at the solution in order to find potential areas to improve (of course, existing pattern provides high level of assurance).
  • A thoughtful architect should hold holistic view of all aspects with software/application/system, such as development effort, infrastructure layout, operationability and  maintainability. Furthermore, take both functional and non-functional requirements into account.
  • A thoughtful architect must have implementation experience with engineering mindset. He must be an expert in his technical domain, with broad experience & knowledge of other technical domains. A high level only talker is not enough to be a thoughtful architect.
  • Try to simplify any solutions. If a solution looks complex, be more alert and suspicious. (again, one of my principals is that everything has solutions, but the best solution is also the simple(st) one. My whole goal is to find this better yet simpler solution!)

Saturday, July 5, 2014

Finally, IBM Made a Smart Decision on Its Integration Product

A couple of months ago during IBM IMPACT conference 2014, IBM announced Integration Bus (IIB) v9 (formerly been known as WebSphere Message Broke) as one of IBM’s flagship integration products. As the same time, IBM asks its WESB (WebSphere Enterprise Service Bus) users to migrate to IIB.  As an IBM technology solution professional, I have been working with a couple of IBM products including IIB and WESB. For long time, I was the person betting that WESB would not have much market. A few reasons made me believe so:


  • IIB or message broker has been on market for long time, it has been adopted by customers world widely. The introduction of WESB is very late.
  • WESB caused confusion about IBM integration product stack. As a result, it hurt IBM sales of IBM integration products (to my believe).
  • WESB is not as easy to use and not as flexible & versatile as IIB.
  • If people would use WESB, why not open source Mule which is very similar to WESB in terms of platform. How did you justify spending much more for WESB instead of using the similar product Mule which is a lot cheaper?
When I was with IBM as IBM consultant, I had never recommended clients to use or shift to WESB because the reasons given above. I never believed that WESB could take off substantially. Now, what IBM announce was incidental to what my thoughts was. Too bad to those IBM customers who purchased WESB and spent tons of money for it. It might be a lesson for those people who made this decision! (the lesson is: the decision maker must be a technology savvy as well. Otherwise, how can he/she makes a right decision about what technology should be adopted?)

Sunday, July 7, 2013

Simplification and Unification

Sometime ago, my boss gave us a book titled “Einstein 1905: The Standard of Greatness”. This book described a story about Albert Einstein who wrote, in the span of six months in 1905, five papers that deeply & fundamentally influenced the course of twentieth-century science. The five papers focused on different aspects of physics as quantum mechanics, molecular dimensions, merger of space and time, and the relationship between mass and energy (the most famous equation).

Reading through this book with great interests, I admired very much unique characters of Einstein’s work. For example, he unified the wave and particle theories; he came out the most elegant and simple mass/energy equation. In addition, he sought for perfection. Very often, simplification and unification are related, which leads to perfection. Reflecting Einstein’s greatness and what has happened in technology (especially in IT) in the past 15 years, I witnessed many instances of simplification and unification. To further illustrate them, a few examples are listed below.

Simplification
  • From binary data format to XML: Prior to XML standard, many data were stored in binary format, which is proprietary, hardly interchangeable, readable and descriptive. The occurrence of XML made the data presentation as text based, which is much simpler than binary. In addition, XML can be self-descriptive and self-defined.  
  • Consolidated Data Store (CDS): a pattern of information management for managing multiple data stores throughout an enterprise. The data mess in many enterprises results in the fact that the enterprise has many data sources out of control. Another consequence of this mess is that same business entity has different versions of data associated, meaning there is no one source of true data across the enterprise. CDS is targeted to simplify the complexity of enterprise data access and management, as well as to provide one authoritative data source.  
Unification
  • Internet (http protocol): Needless to say that Internet has been becoming the most unified protocol in our society, touching our daily life everywhere.
  • Enterprise Service Bus (ESB): ESB is a software architecture model (can also be taken as a pattern of IT infrastructure), which is for designing and implementing the interaction and communication among applications. Prior to ESB, the communication of systems or applications was tightly coupled, with their own communication protocols. This caused enormous burden for an enterprise with disparate data sources and systems in terms of development, maintenance and operation. ESB unified the interaction and communication among different participants in the whole enterprise ecosystem, enabled IT more agile and flexible.
Finally, every professional should keep simplification and unification as his/her ultimate goal of excellence. Keep in mind that simplicity is beauty!


Saturday, February 16, 2013

How did I bankrupt USPS

I don't remember how long ago the news had been spreading about the fact that USPS (United State Postal Service) is on the brink of bankruptcy. One news said that it has lost $16 billion in the last year and $41 billion in the last five. One of the actions to reduce the large deficit that USPS takes is to reduce services, such as canceling Saturday delivery according to the  most recent news. Needless to say that it is a very sad story, as post service has been part of our life for very long time.

It turned out that I was the person, among many, who brought USPS into bankruptcy by using its expense services for free. It sounds unbelievable, but very simple to do it. Here is what I did.

My wife has a side business of exporting made-in-USA staff to China. As part of this business, she had quite a few documents and samples to be sent to China. Usually she uses USPS Global EMS (express mail service) for this purpose. She often received, from her partner in China, complaints that the EMS shipping did not arrived in time. As a result of the complaints, I took a step to help her looking into this problem. I found that the receipt from USPS Global EMS has a guaranty of the delivery. If USPS fails to deliver the goods within the guaranteed time, you can file a request for refund. After I found the guaranty on the receipt, I scrutinized every USPS Global EMS shipping to China. It turned out that EVERY USPS EMS shipping to China, in our cases, were not within the guaranteed time. Therefore, I filed every time a refund request. Giving an example, I received the refund of $508.45 in total for five shipping during late of last year. Wow, $508.45 refund. Not only I got refund, my documents/samples had also been delivered by EMS in China except a few days delay (usually the delay is about 1-3 days).

Relating to USPS money losing business, at least two things I believe that USPS should do.

(1) Answer the 3 questions, namely where, how, and why the money has been lost. Of course, technology can help to identify where the problems are, and how the problems happen. The technology here is business intelligence (BI). Yes, using BI gives a comprehensive picture of how business is performing. However, one need to drill down to find the real mechanisms that makes business work or not work. This needs a deep attention from human side.

(2) Continuing from last point, one cannot just stay at the level of knowing the results of business performance. We should identify why the results take place. For example, the management of USPS should not just know that EMS was probably not performing well, or should not just stay at knowing that the total number of refunds resulted from EMS service. USPS management needs to take step further to identify why the refunds happened, and how they happened. By doing this kind of drill down analysis (leveraging BI), the management could probably find that majority of EMS was only delayed by 1-3 days comparing what were guaranteed. Furthermore, USPS should look for why the delivery was delayed. Was it by the shipping from central stations (usually in big city in China) to satellite stations? or by ground transportation schedule? Only based on these kind of detailed analysis, USPS can design a better delivery guaranty for its services, which can help not only its profitability, but also customer expectation and satisfaction.

Sunday, January 6, 2013

My Unpublished Work Almost 20 Years Ago

Note: My Ph.D. thesis at Swiss Federal Institute of Technology Lausanne (EPFL) was about numerical simulation on how material breaks due to faults (such as cracks) propagation. The numerical method used is based on the theory of so called Finite Element Analysis. To start off, one needs to digitize the object. In my case, I studied the digitization of the material so that I can apply the physical laws/equations to it. The digitization or discretization (from the word of discrete) is also termed as mesh generation in this field. I worked with Dr. Parviz Navi together for coming out a very simple algorithm for the 2-dimensional mesh generation for any type of irregular shapes of heterogeneous material. In fact, this algorithm can be extended to 3-dimensional situation in a rather straightforward fashion.

Coincidentally, sometime after moving to Tampa in 1995, I found that a very similar effort was carried out at Microsoft Lab as part of a project titled as "Surface Reconstruction", which was used for developing games. Very recently, the similar technology is being used for 3D printing (a very promising emerging technology), as a 3D model can be represented by a series of triangular shapes before making it to a real part through 3D printer.

Unfortunately, the algorithm of mesh generation or surface reconstruction (no matter how it is called) that Dr. Parviz Navi and I developed almost 20 years ago has never been officially published. What happened was we wanted to publish it on an academic journal. I wrote the majority of a draft describing the algorithm, and waiting for Dr. Navi to write the first section as an introduction. However, he never finished this section by any reasons. I would like to put the original draft as my blog for the memory of an unforgettable period of my time living in Switzerland. 

(some notations are messed up due to HTML being not able to represent Greek letters.)

2 DESCRIPTIONS

The method of the triangle mesh generation proposed in this article will be described here. Section 2.1 will give the definitions and observations made about triangle generation. Next we will be discussing about the triangulation. Finally the optimization methods that we have been using is introduced.

2.1 Definitions and Observations

Throughout this paper, we will use the terminology as exterior and interior boundaries, fixed points, movable points and connections. Their definitions are given as follows.

Exterior boundary:  a line that outlines the outermost shape of a domain.
Interior boundary:  a line that outlines a void inside a domain.
Fixed points:  points used to define the boundary of a domain to be meshed. The boundary can be exterior or in interior.
Movable points:  points inside a domain. The position of the movable points will be shifted up to the optimization criterion.
Connection:  a straight line connecting one point to the other. Connections are grouped as fixed and non-fixed connections.
Fixed connection:  a connection between two fixed points. The fixed connections online either an exterior or interior boundary.
Non-fixed connection:  a connection between a fixed point and a non-fixed point, or between two non-fixed points.

Figure 2.1 shows graphically the fixed, movable points and connections.


Fig. 2.1  Definition of fixed, movable points and connections.

In our case, the mesh consists of triangles. We have observed that the triangular mesh is completely determined by two variables. One is represented by a set of the positions of the movable points, the other are connections. We name the two variables as  Sp and Sc respectively. There is one and only one triangular mesh corresponding to a given set of the positions and connections. Figure 2.2 sketches the relationship among Sp, Sc and the corresponding meshes.

Fig. 2.2  Schematic representation of relationship among the positions of movable points (Sp), their connections (Sc) and corresponding meshes.

As it is shown in Figure 2.2, a set of different possible positions and their connections will result in different meshes. Usually, these meshes do not meet with the precision requirement of calculation without undergoing an optimization process. The optimization process will improve the performance of the mesh.  Hence, the aim becomes to find an optimized mesh from those with different values of Sp and Sc. The relationship of the positions of the movable points, their connections in terms of the mesh performance is shown in Figure 2.3.  It's possible to achieve this aim by following two different passages, (1) altering the value of Sp, which means to shift the positions of the movable points, and (2) changing the value of Sc, which means to rearrange the connections of the movable points. The optimization methods that we use will be described in Section 2.3.


Fig.2.3 Schematic representation of mesh performance in terms of
position of movable points (Sp), their connections (Sc).

2.2 Triangulation

Before the triangulation starts, the fixed connections (see the definition of the fixed connection in Section 2.1) should be established.  The fixed connections online the boundary of a domain. The boundary can be either exterior or interior.  Next, a simple triangulation method is used. It starts from connecting one point to the other, no matter it is a fixed or movable point.  Figure 2.4(a) shows the possible connections for the point 1 (1-6, 1-3 and 1-5) and the connection for point 2 (2-5).

When connecting one point to the other, the following two criteria must be checked:
  • No connection has been existed for the two points to be connected. This can be further explained by the aid of the example shown in Figure 2.4(a). For instance, before the connection between the point 1 and 6 is established, the program should check if there is already a connection between these two points.  If it's so, then the connection should be aborted.
  • A connection to be established should not intersect with any existing connections. For example, a connection from point 4 to point 5 in Figure 2.4(a) should not be allowed, since it will intersect with the connection 1-3.
After the process of the connection goes through all points, the triangulation is completed. An example of the final triangulation is shown in Figure 2.4(b).


Fig. 2.4  Triangulation – connecting one point with the other.

2.3  Optimization of Triangles

As mentioned above, we optimize a triangular mesh by rearranging the non-fix connections and by changing the movable points' position.  We note that each movable point, in the case of triangular mesh, is surrounded by a polygon (a triangle is also treated as a polygon.).  There are two types of polygon, concave and convex, as shown in Figure 2.5.  Our optimization algorithm needs to know if a polygon is concave or convex.  It is found that in the case of a concave polygon (Figure 2.5(b)), the area constructed by the points 1-2-3 is negative due to clock-wise counting of the points. While, for a convex polygon, the same area calculated by connecting the points 1-2-3 is positive. Therefore, to determine whether a polygon is a concave or convex, successive areas constructed by the points 1-2-3, 2-3-4, etc. (Figure 2.5) are calculated. If an area is negative, then, the polygon is concave. After knowing the type of the polygon surrounding each movable point, we are about to discussing the proposed optimization as follows.


Fig. 2.5  Definition of convex and concave polygon:  (a) Convex; (b) Concave.

2.3.1 Changing the movable points

This optimization is performed for both convex and concave polygons. For a convex polygon, the point inside the polygon will simply be moved to the centroid of the polygon. While for a concave polygon, a two-step process will be followed.  We will use an example shown in Figure 2.6 to explain the two-step process.  The initial triangular elements and concave polygon are shown in Figure 2.6(a). The movable point is labeled by the letter P.

Step 1: The two connections D-E and D-C (connecting to the vertex D of a concave polygon, Figure 2.6) will be extended.  In the example shown in Figure 2.6, the extension of the two connections will insect with other connections of the polygon. The intersection points are I and J, in Figure 2.6.  The extension process results in a convex polygon indicated by A-J-D-I-F.

Step 2: The movable point P will be moved to the centroid of the resulting polygon A-J-D-F, same as we do for other convex polygons.  The final triangular elements are shown in Figure 2.6(b).

It is found that the point P can not be placed outside the polygon A-J-D-I-F, resulted from the extension of the connections D-E and D-C.  Otherwise, one of the connections between the point P and the points A, B, C, D, E, F will intersect with other connections.  For instance, if the point P is placed inside the polygon B-C-D-J, then the connection P-E will intersect with the connection D-C.

As a result that the point P must be inside the convex polygon A-J-D-I-F, question becomes where this point should be placed. We place it at the centroid of the convex polygon, as mentioned in the step 2.

Fig. 2.6  Replacing the movable point P: (a) initial position of the
point P and triangular elements; (b) final position of the point P and triangular elements.

2.3.2  Rearranging the non-fix connections

Rearranging the non-fixed connections are performed only for the convex polygon. Figure 2.7 gives an example of this rearrangement.  Basing on the direct judgement, we know that the connection A-C has to be changed to B-D, shown in Figure 2.7.  This is because the triangle ADB and DCB (Figure 2.7 (b)) is closer to the equilateral triangle than ADC and ACB (Figure 2.7(a)).  However, the direct judgement cannot be coded into a computer program. Hence, for this respect, a more general criterion has been formulated.


Fig. 2.7  Rearranging the movable point's connection.


It is well know that every triangle is well defined by three internal angles, a, b and g.  As a result, a vector can be constructed in the space of a, b and g as = {a, b, g}. Note that a + b + g = 180o, which represents a plane intersecting with a, b and g axis at 180o, shown in Figure 2.8. Any triangle will be a point on the plane, pointed by the corresponding vector of its internal angles.  The point on the plane for an equilateral triangle will be pointed by the vector ={60, 60, 60}.  It can be easily approved that (={60, 60, 60}) is perpendicular to the plane. In other word, the magnitude of for the equilateral triangle will be the minimum among the magnitudes of all


Fig. 2.8  Plane representing any triangle in the space defined by the internal angles a, b and g of triangles.

Hence, to determine if a non-fixed connection should be rearranged, the sum of the magnitudes is calculated forfor the two adjacent triangles.  In the case shown in Figure 2.7 (a), the magnitudes of the vectors for triangles ACB and ADC will be calculated. The sum of the two magnitudes is made. Then, this sum is compared with that for the triangles ADB and BDC, from Figure 2.7(b).  Since the latter is less than the former, the connection A-C (Figure 2.7(a)) will be replaced by B-D (Figure 2.7(b)).

The optimization process of changing the movable points and rearranging the non-fixed connections is performed alternatively. This process is also repeated for a few times. Our experience shows that after several iterations of the optimization, the shape of triangular elements will be improved greatly and stabilized.  The numbers of the movable points changed and of the non-fixed connections rearranged are recorded during the iterations of the optimization process. When the numbers are smaller than a given value, then the iteration can be stopped.  Examples from above-described triangulation and optimization will be shown in the next section.

3 EXAMPLES AND DISCUSSIONS

After giving the triangulation and its optimization algorithms in the last section, we are now in the position to show the results from their applications.  Three examples will be illustrated in this section to show the workability of our triangulation method and the efficiency of the optimization algorithm. A brief discussion will be followed for each example as well.

3.1  Example 1 – a structure of particle reinforced materials

Domains used in this example are composed of two different materials: matrix and irregular grains.  The grains usually have different properties from those of the matrix, for instance, stiffer than the matrix.  As a result, the triangles corresponding to the matrix and grains must be distinguished, before the finite element calculation is performed for the composite.  As the first step to mesh the composite, we generated a set of fixed points for the outer boundary of the composite specimen and for the boundary of grains, as well as a set of the movable points inside the doma0ins.  The result after the operation of the first step is shown in Figure 3.1(a).  Subsequently, the fixed connections are established within the fixed points.  Then, both fixed and movable points are connected each other by the method described in Section 2.2.  The initial triangles are shown in Figure 3.1(b).  Finally, the resulted triangular mesh after an initial optimization and third iteration are respectively shown in Figure 3.1(c) and (d).

Fig. 3.1  Example of triangular mesh generation for a composite.
(a)    Fixed and movable points, as well as fixed connections.
(b)    Initial triangular elements after connecting each points.
(c)    Triangles after first iteration of optimization.
(d)    Triangles after third iteration of optimization.



3.2  Example 2 – a structure of fiber reinforced materials

As shown in the last example, the mesh generation method that we proposed in this article works for any case. It is especially powerful tool for the study of composites.  Here another example of composite – fiber reinforced materials is given.  This material is composed of a matrix and fibers, which are embedded in the matrix with a regular pattern.  Examples of fiber reinforcement materials can be the natural wood material or synthesis fiber reinforced plastic.

Figure 3.2(a) shows the results after the generation of the fixed, movable points and the fixed connections. The initial triangular elements are given in Figure 3.2(b).  The results corresponding to the first and final (third) iterations of optimization are demonstrated in Figure 3.2(c) and (d) respectively.


Fig. 3.2  Example of triangular mesh generation for a fiber reinforced material.
(a) Fixed and movable points, as well as fixed connections.
(b) Initial triangular elements after connecting each points.
(c) Triangles after first iteration of optimization.
(d) Triangles after third iteration of optimization.



3.3  Example 3– Swiss map

The last example given in this paper is the triangular mesh for the map of Switzerland, where our university is from. The outline of the Swiss map is quite irregular.  Same as the previous two examples, we give the fixed, movable points and the fixed connections in Figure 3.3(a).  Note that there is inland lake (named Lake Neuchatel) close to the left frontier of Switzerland.  The initial triangular elements, elements after the first and third (final) iterations of the optimization are illustrated in Figure 3.3(b), (c) and (d) respectively.


Fig. 3.2  Example of triangular mesh generation for Swiss map.
(a) Fixed and movable points, as well as fixed connections.
(b) Initial triangular elements after connecting each points.
(c) Triangles after first iteration of optimization.
(d) Triangles after third iteration of optimization.

4 CONCLUSIONS

In this paper, we have presented a method of triangular mesh generation and algorithms to optimize the resulted triangles.  The method is based on the concepts of fixed and movable points, as well as connections between the points.  A connection is further distinguished into the fixed and non-fixed one.  The fixed connections form the boundary for a domain to be meshed.  The optimization process works on moving the movable points and on rearranging the non-fixed connections.

The studies on the proposed method and the algorithms show that they have the following characteristics and advantages comparing other methods.