Skip to Main Content
IBM Sterling


This portal is to open public enhancement requests for IBM Sterling products and services. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).


Shape the future of IBM!

We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:

Search existing ideas

Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updates on them if they matter to you. If you can't find what you are looking for,

Post your ideas
  1. Post an idea.

  2. Get feedback from the IBM team and other customers to refine your idea.

  3. Follow the idea through the IBM Ideas process.


Specific links you will want to bookmark for future use

Welcome to the IBM Ideas Portal (https://www.ibm.com/ideas) - Use this site to find out additional information and details about the IBM Ideas process and statuses.

IBM Unified Ideas Portal (https://ideas.ibm.com) - Use this site to view all of your ideas, create new ideas for any IBM product, or search for ideas across all of IBM.

ideasibm@us.ibm.com - Use this email to suggest enhancements to the Ideas process or request help from IBM for submitting your Ideas.

Status Functionality already exists
Created by Guest
Created on Jul 25, 2024

Automatic recovery for critical failures such as database failures

We have an SFG/B2B environment installed on Linux/AIX machines and in tests simulating the loss of the database we noticed that after the database becomes operational, SFG/B2B does not recover, making it necessary to perform a STOP/START of the product.
Therefore, we suggest that an agent be added to manage the SFG/B2B and to notice critical failures like this in the database, thus making the STOP/START automatically.
This agent could be configured to perform a specific product action, run a script, configure how many attempts it could make to restart the product, and the interval between attempts.

  • Admin
    Mark Allen
    Reply
    |
    Dec 4, 2024

    Thank you for taking the time to provide your ideas to IBM. We truly value our relationship with you and appreciate your willingness to share details about your experience, your recommendations, and ideas.

    We have reviewed this request with our technical team and have determined that we may already support the use case you have described in your request.

    • The core product already supports recovery scenarios when B2Bi/SFG is configured correctly and is using a proper topology with sufficient redundancy

    • 6.2.x introduced JDBC pooling for DB connections to reduce the time taken to recover from a primary DB node connection loss

    • BPs need to be configured to leverage existing recovery mechanisms such as "on fault" features. This needs to be done on a case by case basis as the data BPs handle need to be considered to avoid data duplication or processing duplication

    • If the DB pointers can be reused after an outage, B2Bi will pick up where it left off when the connection is dropped. If it cannot, manual intervention is needed by design as duplicate processing needs to be avoided


    In summary, we would recommend to upgrade to version 6.2.x and ensure that your BP designs are using the existing fault-tolerant/error handling features. If you continue to experience problems, open a support case.

  • Guest
    Reply
    |
    Sep 4, 2024

    This is really important, whenever there is a failure with the database for a long period, some BPs remain false Active and the application needs to STOP/START for the Recovery process (BP Recover.bpml) to execute correctly.

  • Guest
    Reply
    |
    Jul 25, 2024


    I've seen this happen before.

  • Guest
    Reply
    |
    Jul 25, 2024

    Its a very important enhancement to allow that that tech product Platform be resilient and reliable. Autorecovry withou human action is very very important to us