We have moved to www.dataGenX.net, Keep Learning with us.

Monday, July 29, 2013

Setting Up and Using PuTTY

  •  
  • Run the installer package and extract the PuTTY components. PuTTY may be downloaded from http://www.chiark.greenend.org.uk/~sgtatham/putty/
  • Start the PuTTYGen application. This application is used to create your authentication keys. Select either "SSH2 RSA" or "SSH2 DSA" for the "Type of key to generate", with 1024 bit key size or greater. Press the "Generate" button and follow the instructions on the screen.

Sunday, July 28, 2013

Working with EditPlus Text Editor-Regular Expression How To

Editplus is a lot better than the regular text editor, Notepad.
From all it’s features I like RegExp Support the most, and than comes the block select feature.
Here are the quick lines to carry out regular tasks using regular expression in Editplus. It’s kinda downloadable cheetsheet list.

Remove all empty lines:

Find: “^\n” (Ignore double-quotes in all find/replace)
Replace: “”
Where,
^ – Beginning of the line
\n – New Line

Remove Multiple Spaces convert into single space:

Find: ” +”
Replace: ” “
Where,
+ – find one or more occurance of space character.

Comment multiple line of code:

Find: “^”
Replace: “#” or “//”
You may optionally use: Edit Menu > Format > Line Comment.
Generate Comma Separated List from new line delimited list:

Find: “\n”
Replace: “, “
This helps in even joining some of lines of code instead of replacing by comma you may replace it with “”.

Manipulate columns display order / punctuation:

Find: “([0-9]+)\t([a-zA-Z]+)”
Replace: “\2\t\1″
Where,
[0-9]+ – Finds one or more digits
[a-zA-Z]+ – Finds one or more characters
() – mark the block or capture the group
\2 – 2nd mark expression
Eg:
123 abc
345 cde
567 efg
Becomes:
abc 123
cde 345
efg 567
The Other Way:
- Press Alt+C
- Drag you mouse to select respective column and click
- Copy / Cut as required
[ad#ad-2-300x250]

Append / Add semicolon (any character) at the end of the line:

Find: “\n”
Replace: “;\n”

Enclose lines by quotes:

Find: “\n”
Replace: “‘\n’”

Delete all lines containing a given STRING:

Find: “^.*STRING.*$”
Replace: “”

Remove lines not containing a given STRING:

I don’t know how to do this!! :)

Convert tab separated file into insert statements:

TSV: abcd de4 iirn 34399
SQL: INSERT INTO TABLENAME VALUES (“abcd”, “de4″, “iirn”,”34399″);
Find: “(.*)\t(.*)\t(.*)\t(.*)”
Replace: “INSERT INTO TABLENAME VALUES (“\1″, “\2″, “\3″,”\4″);”

Format the telephone number:

Find: “([0-9][0-9][0-9])([0-9][0-9][0-9])([0-9].*)”
Replace: “\1-\2-\3″
Eg.:

Original: 1231231231
Formatted-1: 123-123-1231

Remove Brackets:

Find: “\(|\)”
Replace: “”
Where,
\( – Match (. \ is required to escape marking the expression.
| – or

Replace 1st occurrence of character:

Find: ” (.*)”
Replace: “-\1″
Where,
(.*) – matches everything and marks the block
** Make sure you ignore double-quotes(“) while writing in find / replace boxes.

EditPlus supports following regular expressions in Find, Replace and Find in Files command.

Expression – Description
  • \t – Tab character.
  • \n – New line.
  • . – Matches any character.
  • | – Either expression on its left and right side matches the target string.
  • [] – Any of the enclosed characters may match the target character.
  • [^] – None of the enclosed characters may match the target character.
  • * – Character to the left of asterisk in the expression should match 0 or more times.
  • + – Character to the left of plus sign in the expression should match 1 or more times.
  • ? – Character to the left of question mark in the expression should match 0 or 1 time.
  • ^ – Expression to the right of ^ matches only when it is at the beginning of line.
  • $ – Expression to the left of $ matches only when it is at the end of line.
  • () – Affects evaluation order of expression and also used for tagged expression.
  • \ – Escape character. If you want to use character “\” itself, you should use “\\”.

Notable Features of Editplus are :

  • Spell checking
  • Regex-based find & replace
  • Encoding conversion
  • Newline conversion
  • Syntax highlighting
  • Multiple undo/redo
  • Rectangular block selection
  • Auto indentation
  • Code folding (Text folding)
- See more at: http://kedar.nitty-witty.com/blog/working-with-editplus-text-editor-regular-expression-how-to#sthash.3CzDA4Bl.dpuf

Editplus is a lot better than the regular text editor, Notepad.
From all it’s features I like RegExp Support the most, and than comes the block select feature.
Here are the quick lines to carry out regular tasks using regular expression in Editplus. It’s kinda downloadable cheet-sheet list.

Remove all empty lines:
Find: “^\n” (Ignore double-quotes in all find/replace)
Replace: ""
Where,
^ – Beginning of the line
\n – New Line

Monday, July 22, 2013

File Extensions Used in Linux

.bz2       A file compressed using bzip2 .                             
.c         A file written in the C programming language.                           
.conf      A configuration file. In some cases, conf is used in a file name, but not as an extension.                 
.deb       A Debian package for installing software.                             
.lock      A lock file that prevents the use of another file.       

Friday, July 19, 2013

DataStage Jobs hang or have slow performance when temp directories have many files


Datastage jobs write multiple files to temporary directories which are not automatically cleaned up. When the number of files grows large, DataStage jobs will have slower performance or can appear to hang.

DataStage jobs write multiple files to temporary directories which are not automatically cleaned up. When the number of files grows large, DataStage jobs will have slower performance or can appear to hang. For sites who have been running DataStage for a year or more and have not previously performed cleanup of these directories, they could contain 100,000 or more files.
Perform periodic cleanup of old DataStage files in these directories to prevent performance problems relating to the number of files in the directories.

Thursday, July 18, 2013

Generating an SSH key pair using PuTTY



    1. Start PuTTYgen by clicking Start > Programs > PuTTY > PuTTYgen. The PuTTY Key Generator panel is displayed.
    2. Click SSH-2 RSA as the type of key to generate.
    Note: Leave the number of bits in a generated key value at 1024.
    1. Click Generate and then move the cursor around the blank area of the Key section to generate the random characters that create a unique key. When the key has been completely generated, the information about the new key is displayed in the Key section.

    Wednesday, July 17, 2013

    DWH One Liners - Part-2


    Part 1 : You can find HERE


    dimension tables
    Tables used to establish the context of the facts.

    dimensional database
    A single large table of facts described using measures and dimensions.

    dimensions
    In a dimensional database, the context for a fact and is used in queries to group related facts together.

    Tuesday, July 16, 2013

    DWH One Liners - Part-1


    Part 2 : You can find HERE


    Ad hoc reports
    Predefined reports that cater to the requirements of novice users.

    Analytics
    Refers to the analysis and interpretation of data using suitable statistical tools and techniques.

    Business intelligence
    Refers to the technologies, methods, and processes employed to understand data stored in data warehouses that will in turn help make important business decisions.
    BI is getting the right information into the right people's hand in a format that allows them to understand the data quickly.
    And more formal definition from wiki:
    "BI refers to skills, processes, technologies, applications, and practices to support decision making"

    (Thanks for this share Avik Sadhu)

    Friday, July 12, 2013

    Basics of Teradata Database


    What exactly is a Teradata Database?

    • Teradata Database is a RDMS (Relational Database Management System) which helps in driving the company’s Data warehouse.
    • It provides foundation for the company to grow, compete, evolve business by getting answers to a new generation of questions (typically “what if” queries)
    • It supports business growth from gigabytes to whooping hundreds of Terabytes
    • Teradata Database is an Open system and compliant with major support from ANSI standards
    • It works on UNIX MP-RAS, WINDOWS and LINUX operating systems, connects with other client through Network Attached channel (TCP/IP) and Channel connection for Mainframe.

    Thursday, July 11, 2013

    Convert a single row into multiple rows ( horizontally pivoting ) with Pivot stage ?


    In this example, the Pivot Enterprise stage is set up to horizontally pivot some data.
    You can generate a pivot index that will assign an index number to each row within sets of horizontally pivoted data. The following tables provide examples of data before and after a horizontal pivot operation.


    Input Data
    REPID,last_name,Jan_sales,Feb_sales,Mar_sales
    100,Smith,1234.08,1456.80,1578.00
    101,Yamada,1245.20,1765.00,1934.22
    102,Xing,2190.89,1287.98,2054.55
    103,Anderson,1498.09,1287.23,3298.76


    Wednesday, July 10, 2013

    Unable to open jobs with error "Unable to open the Job Unable to load RID"


    Unable to open jobs with error "Unable to open the Job Unable to load RID - c2e76d84.43058877.7e01af4f.cd92bbcd-2ebc-4cb3.833e.37a962373ef7"

    Also in SystemOut.log file, it has the following:
    [8/25/10 21:21:46:475 UTC] 000000b1 SystemOut O 2010-08-25 21:21:46,475 ERROR ojb.OjbPersistentEObjectPersistence - java.lang.OutOfMemoryError


    Tuesday, July 09, 2013

    When datastage file system becomes full, RT_LOGxxx files can become corrupted. Here is a manual procedure to recreate them


    When many datastage jobs are running for long time, the RT_LOGxxx files can become very big. If the log files are not regularly purged, in some cases, a file system full on the datastage file system can occur. If the corruption of datastage system files is restricted to the RT_LOGxxx files, a procedure to delete and recreate the RT_LOGxxx files can be attempted in order to recover from the corruption.

    Sunday, July 07, 2013

    How To Hide Text In Notepad


    Here is a small trick to hide text inside your windows default text editor i.e Notepad usingcommand prompt,  but this trick works only on NTFS file system.

    Steps to hide text in a Notepad
    1. Open your command prompt Start-->Run and Type cmd
    2. Type cd.. to move to C:\> drive or Type cd desktop to move to your desktop.
    3. Type the below code in your 

    Friday, July 05, 2013

    DATABASE Command in TeraData


    When users negotiate a successful logon to Teradata, they are automatically positioned in a default database as defined by the database administrator. When an SQL request is executed, by default, it looks in the current database for all referenced objects.

    There may be times when the object is not in the current database. When this happens, the user has one of two choices to resolve this situation. One solution is to qualify the name of the object along with the name of the database in which it resides. To do this, the user simply associates the database name to the object name by connecting them with a period (.) or dot as shown below:

    Thursday, July 04, 2013

    Lock/Unlock Oracle Users


    In spite of deleting an oracle user, we can LOCK it. In that way the user won’t be accessible. In future if we want we can UNLOCK it.

    create a user named JACK which is LOCKED:
    SQL> create user jack identified by jack account lock;
    User created.

    Wednesday, July 03, 2013

    Teradata Users



     In Teradata, a user is the same as a database with one exception. A user is able to logon to the system and a database cannot. Therefore, to authenticate the user, a password must be established. The password is normally established at the same time that the CREATE USER statement is executed. 
    The password can also be changed using a MODIFY USER command.

    Tuesday, July 02, 2013

    List the tables in TeraData



    For listing all the tables, views and macro , execute any sql from below as per need


    select * from dbc.tables;


    Monday, July 01, 2013

    List the tables in DB2 LUW & z/OS



    List the tables in DB2 LUW


    list tables for all
    list tables for schema <schema_name>

    or

    select * from syscat.tables ;