Unicode and Non-Unicode Conversion

Unicode and Non-Unicode Conversion in SAP Basis

I. Understanding Unicode and Non-Unicode

Non-Unicode (Code Page Systems):
- Definition: These systems use a single code page (e.g., ISO-8859-1 for Western Europe, Shift-JIS for Japan, GBK for simplified Chinese) to represent characters. Each code page maps specific numerical values to specific characters.
- Limitation: A non-Unicode system can only process characters from its assigned code page. It cannot display or process characters from multiple languages (especially those requiring different scripts like Cyrillic, Arabic, Asian characters) simultaneously within the same system. Data exchange between systems with different code pages is complex and prone to errors (data loss, garbled characters).
- Encoding: Typically uses 1 byte per character for Latin-based languages, but can use 2 bytes for certain Asian languages within their specific code page.
- SAP Relevance: Older SAP R/3 systems (before ECC 6.0) were predominantly non-Unicode.
Unicode (Code Agnostic Systems):
- Definition: A universal character encoding standard that aims to represent all characters from all written languages in the world. It assigns a unique number to every character, regardless of the platform, program, or language.
- Advantage: Allows multilingual support within a single SAP system. Data exchange between different language environments is seamless and reliable. Essential for global businesses.
- Encoding: SAP systems typically use UTF-16 for internal representation (2 bytes per character). This means every character, regardless of its script, takes 2 bytes. This increases database size and memory consumption compared to single-byte non-Unicode systems for Latin-based languages.
- SAP Relevance:
  - Mandatory for SAP NetWeaver 7.40 and higher.
  - Mandatory for SAP S/4HANA. You cannot move to S/4HANA if your ECC system is non-Unicode; a Unicode conversion is a prerequisite.
  - All new SAP installations are Unicode by default.

II. Why Convert to Unicode? (Key Drivers)

SAP Mandate: Required for modern SAP products (NetWeaver 7.40+, S/4HANA).
Global Business Needs: Support multiple languages and character sets simultaneously (e.g., English, German, Chinese, Arabic in one system).
Data Exchange: Ensures reliable and error-free data exchange between different systems and applications, regardless of their native language settings.
Future-Proofing: Positions the system for future SAP innovations and global expansion.
Simplified Landscape: Reduces complexity by eliminating code page conversion issues between systems.

III. Unicode Conversion Methods

The primary method for Unicode conversion is using the Software Update Manager (SUM) with the Unicode Conversion option (UC).

SUM UC (Downtime Optimized):
- SUM Export/Import (Traditional, slower):
  - Export non-Unicode data from the source database.
  - Convert the data during the export/import process to Unicode.
  - Import Unicode data into the target database.
  - Generally higher downtime.
- SUM DMO (Database Migration Option) with UC (Recommended for Database Migrations to HANA with Unicode):
  - Combines Unicode conversion with a database migration (e.g., from Oracle to HANA).
  - Data is read from the source non-Unicode database, converted to Unicode in-flight, and written directly to the target (HANA) database.
  - Often minimizes overall project duration when both Unicode and DB migration are required.
- SUM with NUC (Near Zero Downtime Technology for Unicode Conversion):
  - Uses a "twin schema" approach or logical replication to minimize downtime.
  - Involves a shadow system and incremental data transfer.
  - Requires additional preparation and resources but dramatically reduces the cutover downtime.
  - Complex to set up and requires specific technical expertise.

IV. Phases of a Unicode Conversion Project (General SUM-based Approach)

Pre-Project / Planning Phase:
- Readiness Assessment:
  - Run UCCHECK transaction to analyze custom code for Unicode compliance (e.g., syntax errors related to character handling, field lengths, implicit type conversions).
  - Analyze database size increase (typically 20-30% for data, 50-100% for indexes, due to UTF-16).
  - Identify necessary hardware upgrades (CPU, RAM, Disk).
  - Check for certified OS and Database versions compatible with Unicode.
- Data Archiving/Cleanup: Reduce database size before conversion. Smaller database means faster conversion.
- Custom Code Remediation: This is the most time-consuming part. Adapt custom ABAP programs, screens, interfaces identified by UCCHECK. Use tools like ABAP Test Cockpit (ATC).
- Resource Planning: Hardware, software, human resources (Basis, ABAP developers, functional consultants).
- Backup Strategy: Plan for comprehensive backups before and after conversion.
- Test Strategy: Detailed functional, integration, regression, and performance testing plan.
- Rollback Strategy: A detailed plan in case of major issues.
Preparation Phase (on Source System):
- Implement SAP Notes: Apply specific SAP Notes required for the Unicode conversion tool (SUM UC).
- Kernel Upgrade: Ensure the kernel is at a supported level for SUM and Unicode.
- DDIC Consistency Check: Run SE11 checks for table/field inconsistencies. Resolve all active objects.
- Application Server Readiness: Ensure all application servers (if distributed) are ready.
- DB Consistency Check: Run database-specific checks (e.g., DBCC for SQL Server, DBVERIFY for Oracle).
- Language Installation: Install all required languages in the source system (SMLT).
Execution Phase (SUM UC):
- Start SUM: Launch SUM and select the Unicode Conversion option.
- Pre-processing Phase:
  - SUM performs numerous checks (hardware, software, custom code, database consistency).
  - Creates a "shadow" system for continuous operations during the conversion.
  - Data and dictionary objects are prepared for conversion.
- Downtime Phase:
  - The actual conversion of data and dictionary objects from non-Unicode to Unicode format takes place.
  - Database size increase occurs during this phase.
  - This is the critical phase where the SAP system is unavailable.
- Post-processing Phase:
  - Finalizes the conversion.
  - Performs consistency checks on the new Unicode system.
  - Generates new objects.
Post-Conversion Phase:
- System Startup & Checks:
  - Start SAP system.
  - Verify the system is running in Unicode mode (SM51, check UCS-2 or UTF-16 in rdisp/max_alt_modes).
  - Check SM21 (System Log), ST22 (Dumps), DB02 (Database Performance).
- Database Statistics: Update database statistics on the newly converted Unicode database for optimal performance.
- Kernel Update: If a new kernel was part of the SUM package, ensure it's fully active.
- Adjust Profile Parameters: Review and adjust relevant profile parameters (e.g., zcsa/system_language, zcsa/installed_languages, install/codepage/appl_server).
- Client Management (SCC4, SMLT): Ensure languages are correctly installed and set for clients.
- RFC Destinations (SM59): Verify RFCs are working, especially for non-SAP systems or systems that are still non-Unicode (requires special configuration).
- Printers (SPAD): Test all printers for correct character display.
- Interfaces: Test all interfaces (IDocs, XI/PI, external applications) for correct character handling.
- Background Jobs (SM37): Verify critical background jobs run correctly.
- Security & Authorizations: Test user logins and authorizations, especially for users with special characters in their names or roles.
- Functional Testing: Extensive testing by functional teams.
- Performance Testing: Compare performance with pre-conversion benchmarks.

V. Important Configurations to Keep in Mind

Custom Code Remediation (UCCHECK and ATC):
- UCCHECK: Run it repeatedly during the project lifecycle.
- ABAP Test Cockpit (ATC): Integrate ATC into your development process for continuous checking and to manage the remediation worklist.
- ABAP Development Tools (ADT) in Eclipse: Use ADT's quick fixes and powerful search/replace for efficient code adaptation.
- Focus: Address hardcoded lengths, explicit type conversions (MOVE-CORRESPONDING), character string manipulations, and non-Unicode specific function modules.
Database Sizing:
- Pre-Conversion Analysis: Accurately estimate the database size increase. Plan for at least 20-30% more data space and 50-100% more index space.
- Disk Space: Ensure sufficient free disk space on the database server before starting SUM.
Language Installation (SMLT):
- Install all languages that might ever be used in the system, even if not actively used today. Adding languages post-conversion is significantly more complex and disruptive.
- Ensure the correct supplemental languages are installed.
Profile Parameters:
- zcsa/system_language: Primary system language.
- zcsa/installed_languages: All installed languages.
- install/codepage/appl_server: Needs to be adjusted to a Unicode codepage (e.g., 4102 for UTF-16) post-conversion.
- abap/unicode: Set to 1 (true) for Unicode systems.
- rdisp/max_alt_modes: Used for checking if Unicode mode is active (UCS-2 or UTF-16).
External Interfaces:
- Code Page Conversion: Identify all interfaces (IDocs, RFCs, XI/PI, flat files, web services) that exchange data with non-SAP or non-Unicode systems. These will require specific code page conversion settings on the interface level.
- XML: XML is inherently Unicode, so it usually passes through cleanly, but ensure the underlying data is correctly handled.
- FTP/File Interfaces: Pay attention to the encoding of flat files exchanged via FTP.
Backup Strategy:
- Full Backup: Take a full, consistent backup of the system before starting the SUM Unicode conversion.
- During Downtime: Consider a backup after the critical downtime phase if possible, before starting post-conversion activities.
Performance Tuning:
- Update Statistics: After conversion, immediately update database statistics. The data distribution has changed, and the optimizer needs new information.
- Review DB02: Monitor database performance for any new bottlenecks.
- SQL Plan Cache: Clear and warm up the SQL Plan Cache.
Client Copy Strategy:
- After Unicode conversion, all client copies must be done from a Unicode source system. Mixing Unicode and non-Unicode client copies is not supported.

30 Interview Questions and Answers (One-Liner) for Unicode and Non-Unicode Conversion

Q: What is the main difference between Non-Unicode and Unicode systems regarding character sets?
- A: Non-Unicode uses a single code page; Unicode supports all characters from all languages.
Q: What is the typical internal encoding used by SAP Unicode systems?
- A: UTF-16.
Q: Why is Unicode conversion mandatory for SAP S/4HANA?
- A: S/4HANA systems can only run on Unicode.
Q: Name a key advantage of Unicode for global businesses.
- A: Supports multiple languages simultaneously within one system.
Q: What is the primary tool used for a Unicode conversion in SAP?
- A: Software Update Manager (SUM) with the Unicode Conversion option (UC).
Q: Which transaction code analyzes custom code for Unicode compliance issues?
- A: UCCHECK.
Q: What is the approximate database size increase (data) after Unicode conversion?
- A: 20-30%.
Q: Which phase of Unicode conversion is typically the most time-consuming?
- A: Custom Code Remediation.
Q: What is the purpose of Data Archiving before a Unicode conversion?
- A: To reduce the database size and shorten conversion downtime.
Q: What is SUM DMO (Database Migration Option) with UC?
- A: Combines Unicode conversion with a database migration (e.g., to HANA).
Q: What is the primary benefit of SUM with NUC (Near Zero Downtime)?
- A: Minimizes business downtime during conversion.
Q: How can you verify if an SAP system is running in Unicode mode after conversion?
- A: Check SM51 for UCS-2 or UTF-16 or abap/unicode profile parameter.
Q: Which profile parameter specifies all installed languages in the system?
- A: zcsa/installed_languages.
Q: Why must all required languages be installed before Unicode conversion?
- A: Adding languages post-conversion is very complex.
Q: What needs to be done with database statistics after Unicode conversion?
- A: They must be updated.
Q: Which tool is used to install languages in an SAP system?
- A: SMLT.
Q: What is the impact of Unicode on character storage size?
- A: Characters take 2 bytes (UTF-16), increasing database size.
Q: What is ATC (ABAP Test Cockpit) used for in Unicode conversion?
- A: Centralized custom code analysis and management of remediation worklist.
Q: What are common custom code issues found during UCCHECK?
- A: Hardcoded field lengths, implicit type conversions, character string manipulations.
Q: Which profile parameter indicates whether ABAP is running in Unicode mode?
- A: abap/unicode = 1.
Q: What are the main types of encoding used in non-Unicode systems?
- A: Single-byte (e.g., ISO-8859) or Double-byte (e.g., Shift-JIS) code pages.
Q: What happens if you try to exchange data between non-Unicode systems with different code pages?
- A: Data loss or garbled characters.
Q: What is the "shadow system" in SUM NUC used for?
- A: To minimize downtime by running operations in parallel.
Q: Should you perform data archiving before a Unicode conversion?
- A: Yes, highly recommended to reduce size and downtime.
Q: What is the install/codepage/appl_server parameter's role post-conversion?
- A: It's adjusted to a Unicode codepage (e.g., 4102).
Q: What specific testing is crucial after a Unicode conversion?
- A: Functional, integration, regression, and performance testing, especially for interfaces.
Q: Can you do a client copy from a non-Unicode system to a Unicode system?
- A: No, client copies must be from a Unicode source after conversion.
Q: What should be done with external interfaces (e.g., flat files) after Unicode conversion?
- A: Ensure correct code page conversion settings are applied.
Q: Why is performance testing important after Unicode conversion?
- A: Database size increase and new character handling can impact performance.
Q: What is the primary risk of not performing a Unicode conversion for older systems?
- A: Inability to upgrade to modern SAP releases like S/4HANA or newer NetWeaver versions.

5 Scenario-Based Hard Questions and Answers for Unicode and Non-Unicode Conversion

Scenario: You are performing a Unicode conversion of an SAP ECC 6.0 system using SUM UC. During the PREP_EXTENSION/SUBMOD_EXTENSION_FINALIZE/EU_SWITCH phase (the critical downtime phase), the SUM process repeatedly halts with errors indicating "SQL error 1234: String data right truncation" or "Field too small to hold data." You check the database and confirm that no physical space issues exist.
- Q: What is the most likely underlying cause of these errors during the data conversion phase, and what detailed steps would you take to diagnose and resolve this without restoring to a non-Unicode backup?
- A:
  - Most Likely Underlying Cause: The errors "String data right truncation" or "Field too small to hold data" during the actual data conversion phase strongly point to unaddressed custom code issues (specifically, custom table/field definitions) or dictionary inconsistencies that were not fully caught by UCCHECK or SE11 prior to the downtime. When non-Unicode characters (1 byte) are converted to Unicode (UTF-16, 2 bytes), the length of the data in bytes doubles. If a custom field or table definition has a length that was explicitly defined for single-byte characters and doesn't accommodate the doubled byte length, data truncation or overflow occurs during the conversion process, even if the logical number of characters remains the same. This can happen particularly with character-type fields (CHAR, STRING, RAW) and sometimes with custom data types.
  - Detailed Steps to Diagnose and Resolve:
    1. Analyze SUM Logs and Error Details:
      - Action: Immediately review the detailed SUM logs (e.g., EU_SWITCH*.log, SQLSTMT.LOG, R3TRANS.LOG, UPG_APPL*.LOG) in the SUM directory. These logs will specify the exact table name and often the field name where the truncation or overflow occurred. The SQL error number (1234 in the scenario) should also be cross-referenced with database-specific documentation.
      - Rationale: Pinpoints the exact problematic object(s).
    2. Examine Problematic Table/Field Definition (SE11 / DDIC):
      - Action: In SE11 on the shadow instance (or the source system if still available for read-only access), examine the definition of the table and the specific field identified in the logs. Look at the data type (CHAR, RAWSTRING, LRAW, etc.) and its defined length.
      - Rationale: Determine if the defined length is insufficient for UTF-16 representation, considering the actual data.
    3. Check UCCHECK Results (Post-Mortem):
      - Action: Even if UCCHECK ran pre-conversion, re-run it in test mode or review old logs. Sometimes, UCCHECK identifies warnings that are not "show-stoppers" but become critical during actual conversion. Specifically look for warnings related to STRING or RAW type fields, or implicit conversions that might cause length issues.
      - Rationale: Identify if this was a known (but unaddressed) risk.
    4. Database Native Tools (if applicable):
      - Action: If possible, use native database tools to query the table's actual data in the problem field in the shadow instance. Look for records that might exceed the physical length after byte-doubling (e.g., a field containing 10 single-byte characters might become 20 bytes long, but the table might only be defined for 15 bytes in the converted structure).
      - Rationale: Confirm the exact data causing the issue.
    5. Implement SAP Notes/Custom Fixes (Resolution):
      - Action:
        
        Search SAP Support Portal: Search for SAP Notes related to the exact error message, SUM phase, and table/field. SAP often releases notes for specific known issues during Unicode conversion.
        
        Apply SAP Corrective Notes: These notes might contain dictionary adjustments (e.g., widening table fields) or program corrections.
        
        Manual Dictionary Adjustment (Extreme Caution): As a last resort and only if no SAP Note exists and explicitly guided by SAP Support, you might need to manually adjust the length of the problematic field in the ABAP Dictionary (SE11) on the shadow instance and activate it. This is highly risky and should only be done under SAP's guidance.
        
        Data Cleansing (if possible): If specific problematic data records can be identified, and the issue is not a dictionary definition problem, consider temporarily deleting or truncating problematic data in the shadow system to allow the conversion to pass, and then re-importing/correcting it later. (Highly risky, use with extreme caution).
    6. Resume SUM:
      - Action: Once the root cause (e.g., dictionary length, problematic data) is addressed and the correction implemented, resume SUM at the failed phase.
      - Rationale: Allows the conversion to proceed past the bottleneck.
Scenario: You have successfully completed a Unicode conversion of your ECC system. Post-conversion, you find that data exchanged with a legacy non-SAP system via flat files (FTP) now contains garbled characters or truncation. The legacy system uses a specific single-byte Chinese code page (e.g., GBK).
- Q: What are the potential points where character encoding mismatch could be occurring, and what specific configuration changes would you investigate or implement in SAP and at the OS level to ensure correct data exchange?
- A:
  - Potential Points of Character Encoding Mismatch:
    1. SAP Application Server Encoding: The new Unicode SAP system will internally use UTF-16. When it writes a flat file, it needs to convert this to the target encoding.
    2. FTP Client/Server Encoding: The FTP client (used by SAP) or the FTP server (used by the legacy system) might have default or configured encoding settings that clash.
    3. Operating System (OS) Locale: The OS locale of the SAP application server or the FTP server might influence the default encoding used for file operations.
    4. Legacy System's Interpretation: The legacy system itself might be reading the file using an incorrect code page interpretation.
  - Specific Configuration Changes to Investigate/Implement:
    1. SAP Profile Parameter for File Encoding:
      - Investigation: Check DIR_CHAR_AS_BYTES and sapgui/conversion/cp (less common for server-side files) in SAP profiles. These generally define how SAP handles character conversion for file I/O.
      - Implementation: The most direct way to control the output encoding of a file generated by an ABAP program is within the OPEN DATASET statement itself.
        
        Action: Modify the ABAP program responsible for generating the flat file.
        
        OPEN DATASET <filename> FOR OUTPUT IN TEXT MODE ENCODING <target_codepage_name>.
        
        Example: OPEN DATASET l_file FOR OUTPUT IN TEXT MODE ENCODING '8400'. (8400 is the SAP internal representation for GBK, map to the specific code page of the legacy system).
        
        Alternatively, WITH SMART LINEFEED.
      - Rationale: Explicitly tells SAP how to encode the UTF-16 data into the desired single-byte code page (e.g., GBK) for the external system.
    2. FTP Client (SAP Side) Configuration:
      - Investigation: If using SAPFTP or SAPHTTP (or custom FTP client calls), check if they allow encoding parameters.
      - Implementation: Ensure the FTP client (often implicitly uses OS locale) is not overriding the file's encoding.
    3. Operating System Locale:
      - Investigation: Check the OS locale of the SAP application server (e.g., locale command on Linux, Get-Culture on PowerShell).
      - Implementation: Ensure the OS locale of the SAP application server is set appropriately to support multi-byte characters if needed, although the ABAP ENCODING clause is usually sufficient.
    4. FTP Server Configuration:
      - Action: Coordinate with the legacy system's administrator. Verify their FTP server's settings for character encoding for incoming files.
      - Rationale: The FTP server might also try to convert the file based on its own defaults.
    5. Legacy System File Reader:
      - Action: The most common issue with legacy systems is their file reader implicitly assuming a certain encoding.
      - Rationale: Inform the legacy system team to explicitly read the incoming file with the expected code page (e.g., GBK).
    6. Test Data:
      - Action: Use very specific test data, including challenging characters that are known to cause issues between code pages (e.g., extended characters, special symbols in the Chinese script).
      - Rationale: Ensures comprehensive testing.
Scenario: Your company plans a database migration of an SAP ECC system from Oracle to SAP HANA, combined with a Unicode conversion. The current ECC is non-Unicode. You decide to use SUM DMO with UC for this combined process. Your initial UCCHECK analysis showed a large number of issues in custom code, and the ABAP team is overwhelmed.
- Q: How would you structure the custom code remediation process to integrate effectively with the SUM DMO UC project, and what strategies would you use to minimize the impact of custom code adaptation on the overall project timeline?
- A:
  - Structure the Custom Code Remediation Process:
    1. Early and Continuous Analysis:
      - Action: Begin UCCHECK analysis immediately in the ECC system. Establish a central ABAP Test Cockpit (ATC) system (if not already present). Configure ATC with the S/4HANA readiness checks and Unicode checks. Perform regular code scans.
      - Rationale: Identifies issues early, allows continuous monitoring, and creates a central worklist.
    2. Phased Remediation and Prioritization:
      - Action: Categorize custom code objects based on:
        
        Criticality: Business-critical processes vs. obsolete/non-critical.
        
        Complexity: Easy fixes vs. complex re-designs.
        
        Nature of Fix: Syntax errors (easy, automated via ADT Quick Fixes) vs. semantic errors (harder, requiring developer understanding of data model changes).
      - Action: Focus on syntax errors first (easy wins for ABAP team productivity). Then prioritize critical business processes.
      - Rationale: Provides manageable work packages, builds developer confidence, and addresses show-stoppers first.
    3. Dedicated Custom Code Adaptation Team (or Resource Allocation):
      - Action: Assign a dedicated team or specific ABAP developers to focus solely on custom code adaptation. Ensure they are trained on Unicode programming guidelines and efficient use of ADT/ATC.
      - Rationale: Prevents ad-hoc fixes and ensures consistent, high-quality remediation.
    4. Utilize Automation and Tools:
      - Action: Leverage ADT's quick fixes, global search/replace (with caution), and the Custom Code Migration Worklist (CMWL) app (if relevant, part of S/4HANA readiness, but also applies to general Unicode). CMWL can suggest and sometimes automatically apply fixes.
      - Rationale: Accelerates common and repetitive fixes, freeing developers for complex issues.
    5. Code Freeze and Sandbox for Conversion:
      - Action: Implement a code freeze in the development system (DEV) a few weeks before the actual SUM DMO UC run.
      - Action: Perform a system copy (refresh) of the DEV system to a dedicated sandbox for the conversion testing. This allows developers to work on a stable code base while the conversion team tests the SUM process and adapts the copy.
      - Rationale: Ensures a consistent code base for conversion and allows parallel activities.
    6. Iterative Testing:
      - Action: After each batch of code remediation, perform unit and integration testing in the sandbox.
      - Rationale: Catches issues early before they propagate.
  - Strategies to Minimize Impact on Overall Project Timeline:
    1. Start Early: Initiate custom code analysis and remediation as soon as the project begins, ideally even before the main SUM DMO planning is complete.
    2. Parallel Activities: Run custom code remediation in parallel with other project streams (e.g., infrastructure setup, functional scope definition).
    3. Clean Core Strategy: Encourage and enforce a "Clean Core" strategy from the outset. This means identifying obsolete custom code and decommission it (e.g., using UPL/ACM to identify unused code) rather than adapting it. This drastically reduces the scope of adaptation.
    4. Automated Testing: Invest in automated regression testing for critical business processes. After custom code adaptation, automated tests can quickly confirm that core functionalities still work, reducing manual testing effort.
    5. Knowledge Transfer: Ensure that lessons learned from the first few remediated objects are shared across the team to improve efficiency.
    6. Proactive Problem Solving: Don't wait for SUM to fail. Continuously monitor UCCHECK/ATC results and address high-priority issues as they are identified.
Scenario: You have just completed a Unicode conversion for an SAP ECC system. The system is up and running. However, some users report issues with displaying certain special characters (e.g., German umlauts, French accents, or custom symbols) in reports and input fields within the SAP GUI. Data in the database appears to be correct, and SM51 confirms the system is Unicode.
- Q: What are the possible causes for these display issues after a successful Unicode conversion, and what specific troubleshooting steps would you take in SAP Basis and on the client side to resolve them?
- A:
  - Possible Causes for Display Issues:
    1. SAP GUI Client Issues: The most common cause. The user's SAP GUI client configuration might not be set to display Unicode characters correctly.
    2. OS Locale/Font Issues on Client: The client's operating system locale or installed fonts might not fully support the specific characters.
    3. Printer/Output Device Issues: Printers are not configured for Unicode, leading to garbled output.
    4. Incorrect Language Settings in SAP: The user's logon language or the system's installed languages (SMLT) are incomplete.
    5. Custom Font/Character Set Usage: If custom fonts or very specific character sets are used, they might not be correctly mapped in Unicode.
    6. Legacy Data with Conversion Errors (Rare post-successful UC): Though less likely if SM51 shows Unicode, some very rare character encoding issues might persist in specific older data if the initial conversion had edge cases.
  - Specific Troubleshooting Steps:
    1. SAP GUI Configuration (Client Side - Most Likely):
      - Action: On the user's workstation, open SAP GUI Options -> Interaction Design -> Globalization. Ensure "Use Code Page" is unchecked, and "Activate Unicode (UTF-16)" is checked.
      - Action: Ensure the latest version of SAP GUI is installed. Older versions might have limited Unicode support.
      - Rationale: This is the primary setting that tells SAP GUI to interpret incoming data as Unicode.
    2. SAP System Language Settings:
      - Action: In SU01 for the affected user, check their default logon language. Ensure it's correct.
      - Action: In SMLT on the SAP server, verify that all necessary languages (including the one with problematic characters) are fully installed (status green).
      - Rationale: Incorrect language installation or user settings can affect character display.
    3. Operating System Locale and Font on Client:
      - Action: On the user's workstation, verify the OS locale settings (e.g., Region & Language settings on Windows) are appropriate for the languages being used. Ensure necessary font packs are installed (e.g., Arial Unicode MS, Noto Sans).
      - Rationale: The OS and its fonts render characters; if they lack support, SAP GUI cannot display them.
    4. Printer Configuration (SPAD):
      - Action: If issues occur during printing, check SPAD for the affected printer. Ensure the device type supports Unicode (e.g., SWINCF, Z_SWINCF, or a specific device type for your printer with Unicode capabilities). Adjust font selection if necessary.
      - Rationale: Printers also need to understand Unicode character sets.
    5. Character Set Check in Report/Screen:
      - Action: For specific reports or screens, examine the ABAP code or screen painter (SE51) definitions. While less common after a successful UC, ensure no hardcoded non-Unicode character sets are being forced.
      - Rationale: A safety check if all else fails.
    6. Test with TEST_SPTH:
      - Action: Execute transaction TEST_SPTH in the SAP GUI. This program displays a wide range of Unicode characters. Test if the problematic characters appear correctly here. This helps isolate if it's a general GUI issue or specific to certain applications.
      - Rationale: A quick test to confirm basic Unicode display capability of the SAP GUI client.
Scenario: Post-Unicode conversion, your SAP system's overall performance has significantly degraded, particularly for reports that involve large table scans and complex data manipulations. The database (now HANA) is consuming more memory than anticipated, and CPU utilization is consistently high. Your initial sizing was based on the non-Unicode database size plus 30%.
- Q: What are the primary reasons for this performance degradation after Unicode conversion, and what specific optimization strategies would you implement at the database and application level to restore performance?
- A:
  - Primary Reasons for Performance Degradation:
    1. Increased Data/Index Size: Unicode conversion (UTF-16) means characters now take 2 bytes instead of 1. This increases the physical size of tables and indexes, leading to:
      - More data to read from disk (even on HANA, data is persisted).
      - Larger memory footprint, potentially exceeding HANA's memory capacity (leading to paging/unloading of tables).
      - More CPU cycles for data processing and compression/decompression.
    2. Outdated Database Statistics: After conversion, data distribution fundamentally changes (e.g., character string lengths, new indexes). If database statistics are not updated, the query optimizer will use outdated information, leading to inefficient execution plans.
    3. Inefficient Custom Code: While UCCHECK handles syntax, it doesn't optimize logic. Custom code designed for non-Unicode/row-store databases (e.g., loop at internal table and SELECT SINGLE for every row, SELECT * without WHERE clauses, extensive MOVE-CORRESPONDING with character fields) may perform poorly on a column-store HANA database with larger Unicode data.
    4. Insufficient Hardware Sizing: The initial 30% sizing might have been an underestimate, especially for indexes, or the workload during the reports is higher than anticipated for the increased data volume.
    5. Increased Network Traffic: Larger data records mean more data transferred over the network between application server and database, potentially causing latency.
  - Specific Optimization Strategies:
    1. Database Level (HANA Specific):
      - Update Database Statistics (CRITICAL):
        
        Action: Immediately run a full update of database statistics for all relevant tables (UPDATE STATISTICS). Schedule regular, frequent statistic updates.
        
        Rationale: Ensures the HANA optimizer has accurate information to generate efficient execution plans for new Unicode data.
      - Review HANA Memory Usage (M_SERVICE_MEMORY, HANA_Memory_Overview script):
        
        Action: Monitor HANA memory allocation and consumption. Identify if indexserver is frequently unloading tables or if TOTAL_MEMORY_LIMIT is being hit.
        
        Rationale: Confirms if memory is the bottleneck.
        
        Action: If needed, consider temporarily increasing total_memory_limit or, for a permanent solution, adding more RAM to the HANA server if analysis proves it's undersized for the new Unicode footprint and workload.
      - Reorganize/Optimize Tables (Less Common for HANA, but Check): While HANA column store is self-optimizing, occasionally OPTIMIZE COMPRESSION on specific very large tables might be beneficial if analysis suggests it.
      - Check for Expensive Statements (M_EXPENSIVE_STATEMENTS, SQL Plan Cache):
        
        Action: Identify the specific SQL queries or CDS views causing high CPU/memory during report execution.
        
        Rationale: Pinpoints direct targets for optimization.
    2. Application Level (ABAP Custom Code Optimization):
      - Focus on Top N Expensive Statements: Use the output from M_EXPENSIVE_STATEMENTS to identify the worst-performing custom reports/queries.
      - Code Review and Re-write for HANA:
        
        Action: Re-write inefficient ABAP code to leverage HANA's capabilities:
        
        Push-down Logic: Perform calculations and aggregations directly in the database (e.g., using CDS views, AMDP - ABAP Managed Database Procedures). Avoid pulling large datasets to the application server for processing.
        
        Avoid SELECT * without WHERE: Always specify necessary fields and WHERE clauses.
        
        Replace FOR ALL ENTRIES with Joins: Convert FOR ALL ENTRIES to database-level joins in CDS or AMDP, if possible.
        
        Optimize Internal Tables: Use appropriate internal table types (hashed, sorted).
        
        Minimize Loop-AT-SELECTs: Avoid nested loops with SELECT SINGLE inside.
        
        Rationale: Adapts code to the column-store paradigm for better performance.
      - Functional Re-evaluation:
        
        Action: Work with functional teams to identify if any custom reports can now be replaced by standard S/4HANA Fiori apps or simplified standard reports that are already optimized.
        
        Rationale: Eliminates custom code maintenance and leverages SAP's optimized solutions.
    3. Profile Parameter Adjustments:
      - Action: Review relevant SAP profile parameters in RZ10 that might impact performance (e.g., rdisp/ROLL_MAXFS, rdisp/PG_MAXFS - less direct for HANA, but still relevant for ABAP memory; abap/heap_area_total).
      - Rationale: Ensure ABAP memory is sufficient for increased data sizes.
    4. Network Review:
      - Action: If it's a distributed system, check network latency between application servers and the HANA database.
      - Rationale: Larger data sets mean more network traffic.

Rakshit Ranjan Singh

Search This Blog

Unicode and Non-Unicode Conversion

Unicode and Non-Unicode Conversion in SAP Basis

I. Understanding Unicode and Non-Unicode

II. Why Convert to Unicode? (Key Drivers)

III. Unicode Conversion Methods

IV. Phases of a Unicode Conversion Project (General SUM-based Approach)

V. Important Configurations to Keep in Mind

30 Interview Questions and Answers (One-Liner) for Unicode and Non-Unicode Conversion

5 Scenario-Based Hard Questions and Answers for Unicode and Non-Unicode Conversion

Comments

Post a Comment

Popular posts from this blog

An experiment with the life

Learn Java

Driving