Introduction of Alphabet, String, and Languages

Language Operations

It is simple to give an operation, which are originally defined on strings, to an entire language: for it we just apply the operation to all the sentences..

Reflection / image of a language

The reflection of a language L is the set of strings that are the reflection of a sentence:

L^R = {x | x = y^R ^ y Î L}

(it is known as characteristic predicate);

note that ^ is conjunction, represented by p ^ q, meaning “p and q”.

Here, the strings x are specified by the property expressed in the characteristic predicate.

In other words, for any language L, L^R = {x^R | x Î L},

for example, let:

L = {001, 10, 111} then L^R = {100, 01, 111}.

Concatenation of languages

Concatenation of language L₁ and L₂ is defined as:

L₁•L₂ = {xy | xÎ L₁ ^ yÎ L₂}.

Example 1: Consider the following table.

L₁	L₂	L₁L₂
{a, ab} {a, ab} {a, aa}	{bb, b} {a, ab} {a, aa}	{a, ab}{bb, b} ={abb, ab, abbb} {a, ab}{a, ab} = {aa, aab, aba, abab} {a, aa}{a, aa} = {aa, aaa, aaaa}

From this, the m-th power operation on a language is simple:

L^m = L^m-1•L, m > 0

L⁰ = {e}

That is,

L¹=L,
L²=LL,
L³= LLL, etc.
L⁰ = {e}.

Special case 1: Let us consider some special cases:

f⁰ = {e}

L•f = f•L = f

L• {e} = {e}• L = L

The m-th power operation gives a definition of the string of length not exceeding some integer k. Let us consider the alphabet S = {a, b}.

For k = 3, the language is defined as

L = S⁰ È S¹ È S² È S³.

= {e} È {a, b} È {aa, ab, ba, bb} È {aaa, aab, aba, abb, baa, bab, bba, bbb}.

= {e, a, b, aa, ab, ba, bb, aaa, aab, aba, abb, baa, bab, bba, bbb}.

Language L may also define as:

L = {e, a, b}³.

Set Operations

Since a language is a set, we can apply the set operations: union (È), intersection (Ç), and difference (\), to languages. We can also apply the set relation operations: inclusion (Í ), strict inclusion (Ì), and equality (=) apply as well.

We can define universal language as the set of all strings of alphabet,S, of any length, including zero. Thus, the universal language is infinite and it is the union of all the powers of the alphabet:

L_universal = S⁰ È S¹ È S² È …

The complement of a language L of alphabet, S, denoted by Ø L is the set difference:

ØL = L_universal \ L

i.e., the set of the strings of alphabet S those are not in L.

L_universal = Ø f

When the alphabet is understood, the universal language can be expressed as the complement of the empty language:

Note 1: The complement of a finite language is always infinite, for example, the set of strings of any length except two is:

Ø ({0,1}²) = e È {a,b}¹ È {a,b}³ È ……….

On the other hand, the complement of an infinite language may or may not be finite, For example, the complement of the universal language is finite; and the complement of the set of even length strings with alphabet {a} is:

L = {a²ⁿ | n ³ 0} Ø L = {a²ⁿ⁺¹| n ³ 0}

i.e., L = {a⁰, a², a⁴ ……} is infinite language and its complement Ø L= {a¹, a³, a⁵,….} is also a infinite language.

Star and cross

The star (also known as Kleene’s star and as closure by concatenation) operation is defined as the union of all the power of the base language:

L^* = È L^a

a = 0…∞

Þ L^* = L⁰ È L¹ È L² È …..

Þ L^* = e È L È L² È …..

Example 2: Let us consider L= {ab, ba}, then:

L^* = {e, ab, ba, abab, abba, baab, baba, ……}.

Every string of the star can be divided into substrings which are sentences (refer to definition of sentence) of the base language L.

Note that, with a finite base language L, the “starred” language L* is infinite.

Starred language L* may be identical to base language L, as in:

L = {a²ⁿ | n ³ 0} and L* = {a²ⁿ| n ³ 0}.

Thus, L = L*.

Special case 2: when the base language is an alphabet, S, then the star S* contains all the strings obtained by concatenating terminal symbols.

S = {a}. Let it is the base language.

S* = {e, a, aa, aaa, …}.

Þ S* = S⁰ È S¹ È S² È …

Þ S* = L_universal.

Þ S* = universal language of alphabet, S.

We know that any formal language is a subset of the universal language of the same alphabet.

Therefore, we can also write a relation:

L S*

To say that L is a language of alphabet, S.

Properties of star

L Í L* :(known as monotonicity property).

if (x Î L* ^ y Î L*) then xy Î L* :(known as closure by concatenation property).

(L*)* = L* :(known as idempotence property).

(L*)^R = (L^R)* :(known as commutability of star and reflection property).

The monotonicity property says that any language is included in its star, i.e., L Í L*. But for the language, L = {a²ⁿ | n ³ 0} we have an equality L* = L. This equality follows from idempotence property and the fact that L can be equivalently defined by the starrer language L* which is equivalent to {aa}*.

Special case 3: For the empty language and empty string we have the identities.

f* = {e}.

{e}* = {e}.

Cross

Cross (or non reflective closure by concatenation) is derived from star.

L⁺ = È L^a

a = 1 … ∞

Þ L⁺ = L È L² È …..

It differs from the star because the union is taken excluding power zero. Thus, the following relations hold:

L⁺ Í L*

e Î L⁺ if and only if e Î L

L⁺ = LL* = L*L

Example 3: {e, aa}⁺ = {e, aa, aaaa, …..}. Therefore, {e, aa}⁺ = {a²ⁿ | n ³ 0}.

Theorem 1: Prove that for every language L, (L*)* = L*.

Proof: According to definition, S* is a language in which each string is a concatenation of zero or more strings fromS. Formally:

S*= {w | w = e, or w = x₁……x_k for k ³ 1, and x_i Î S}.

Automatically, we get S Ì S*.

Let w Î (L*)*, we need to prove wÎ L*.

When w = e, then w Î L*, because e Î L*, by the definition of star closure.

Let us assume that w ¹ e. Then we can write w = x₁….x_k for some k ³ 1 (for example, let w = puja, then x₁ = p, x₂ = u, x₃ = j, and x₄ = a;) and all x_i Î L* (for example, let L = {p, u, j, a} then L* = {e, p, u, j, a,….}.

Therefore, we can re-state the above statement as: “show that for any k ³ 1, if w = x₁…..x_k, where all x_i Î L*, then w Î L*”.

Let us proof the re-stated statement by using induction on k.

Base case: when k = 1 then w = x₁, where x₁ Î L*, so w Î L*.

Induction theory: Let us assume that the claim is true for k = i, i.e., w = x₁ ….. x_i, where all x_i Î L*, then w Î L*.

Induction step:

Let the claim is true for k = i+1. Let w = x_1…x_ix_i+1,where all x_iÎ L^*_.

By the definition of concatenation, w = yx_i+1,where y = x_1…x_i.

We can also state that, if x Î L^*and y Î L^*, then xyÎ L^*.

Thus, by induction theory: y Î L^*_, and x_i+1Î L^*_.

Therefore, we conclude that: w = yx_i+1 Î L^*_.

Thus, prove.

Theorem 2: Prove that L^*• L^*= L^*.

Proof: It is very clear that L^*Í L^*• L^*, since any string w Î L^* can be written as w = w•e. Let w Î L^*• L^*. We can express w as w₁•w₂, where w_1•w_2.Î L^*. But if w₁and w₂ are in L^*, then so is their concatenation, this implies that w Î L^*, and hence L^*• L* Í L^*.

It follows that L^*• L^* = L^*.

Thus, prove.

Theorem 3: Prove that " i ³ 1, (L^*)ⁱ= L^*.

Proof: We can prove it by induction.

Base case: We have, (L^*)¹= L^*.

Induction theory: Let us assume that (L^*)^k= L^*, for some k ³ 1.

Induction step: Let the claim is true for k+1, i.e., (L^*)^(k+1) = L^*.

We have,

(L^*)⁽^k+1) = (L^*)^k • L^* - - - -(1)

Using the above induction theory, we can write equality (1) as:

(L^*)^(k+1)= L^*• L^* - - - -(2)

According to the theorem 1.5, we know that L^*• L^* = L^*.

Thus, we can write equality (2) as:

(L^*)^(k+1) = L^*

Means, the claim is true for k + 1. Thus, we conclude that:

" i ³ 1, (L^*)ⁱ = L^*.

Thus, prove.

Theorem 4: By the definition of star and above theorem,
prove that (L^*)^*= L^*.

Proof: By the definition of star, we have:

(L^*)^* = È (L^*)^a

a = 0…¥

Þ (L^*)^* = L^* (from result)

a = 0…¥

Þ (L^*)^*= L^*(from the definition of star)

Thus, prove.

Theorem 5: Let A be any set of string. Prove that A^* = A⁺ if and only if e Î A.

Proof: For any set A, we know that A⁰ = {e} and Aⁿ = {w / w= w₁w₂ w₃………w_n , w_iÎ A for each i = 1, 2, ………., n}, for n ³ 1. Also, A¹ = A.

· First, we will prove that: if e Î A, then A^* = A⁺.

Let e Î A.

Thus, A = A¹= A⁰È A¹ = {e} È A. Means, A¹= A⁰ È A¹.

Therefore,

A⁺= A ¹È A ²È A³ È……..

=> A⁺= A ⁰È A ¹È A ²È A³ È…….. (because A¹= A ⁰È A ¹).

=> A⁺= A^* (because A^*= A ⁰È A ¹È A ²È A³ È…….).

Therefore, we have shown that: if e Î A, then A^* = A⁺.

· Next, we will prove that: if A^*= A⁺,then e Î A. This is equivalent to showing that: if e Ï A, then A^*= A⁺.

Let e Ï A.

Thus, we get, A¹¹ A⁰ È A¹.

Also, if e Ï A, then e Ï Aⁿ for any n ³ 1, because we can never concatenate non-empty strings together to obtain an empty string.

Therefore, A^* = A⁰È A¹ È A² È … cannot equal to A⁺ = A¹ È A² È A³ È …… since e Î A^*but e Ï A⁺. Therefore, we have shown that: if A^*= A⁺, then e Î A.

Thus, prove.

Example 1: Consider the following two language over the alphabet, ∑ = {a, b}:

A = {ab, a, e, abb}

B = {e, bb, b}

(a) List all the strings in the language A•B of length 3.

(b) List all the strings in the language A^* of length 3.

(c) List all the strings in the language A-B.

Solution: (a) abb.

(b) {aaa, aba, aab, abb}.

Example 2: For each of the following 5 languages, give two least strings that are in (Î) the language and two least strings that are not in (Ï) the language.

(1) aa + aab (2) b^*(ab)^* a^* (3) (a^*+b^*) (a^*+b^*) (a^*+b^*)

(4) a^*(baa^*)^* b^* (5) b^*(a+ba)^*b^*

Solution:

	(1)	(2)	(3)	(4)	(5)
	ab + aab	b^(ab)^ a^*	(a^+b^) (a^+b^)(a^+b^)	a^(baa^)^ b⁺*	b^(a+ba)^ b^*
Î	ab, aab	e, a	e, a	e, a	e, a
Ï	e, a	aab, abb	abab, baba	bba, abba	abba, aabba

Example 3: Consider the following two languages on the alphabet

å = {0, 1};

L₁ = {0ⁿ: n ³ 1};

L₂ = {01ⁿ: n ³ 0}.

Describe the languages below.

(a) L₃ = L₁^* (b) L₄ = L₁⁺ (c) L₅ = (d) L₆= L₂^*

(e) L₇ = L₁Ç L₂ (f) L₈ = L₁ L₂

Solution: (a) L₃ = L₁^* = {0ⁿ: n ³ 0} = the set of the empty string and all strings that have no 1’s.

(b) L₄ = L₁⁺= {0ⁿ: n ³ 1} = the set of all non-empty strings that have

no 1’s.

(c) L₅ = = {e} È {w: w include at least one 1} = the set of empty string and all strings that have at least one 1.

(d) L₆ = L₂^*= {e} È {0w: w is any string over, å } = the set of empty string and all strings that begin with 1.

(e) L₇ = L₁Ç L₂ = {0}.

(f) L₈ = L₁L₂ = {0^m1ⁿ: m ³ 2 and n ³ 0}.

---------------------------------------------------------------------------------------------------------------------

For further detail study:

Refer to book:

AUTOMATA THEORY: A STEP-BY-STEP APPROACH Published By S. CHAND PUBLICATION, New Delhi. Author: MANISH KUMAR JHA

===========================================

Introduction of Alphabet, String, and Languages

Saturday, 14 May 2016

Finite Automata, DFA, Transition Function, String Reading, State Transition Diagram, State Transition Table, Language of DFA, Sink State, Properties of DFA, Limitations of DFA, DFA Accept and Reject Algorithm, and all related Examples.

Thursday, 24 December 2015

Language Operations, Reflection (Image) of Language, Concatenation of Language, Set Operations, Star, Properties of Star, Cross, Special Cases, Related Theorems with its Proofs, and Related Examples with its Solutions.

Language Operations

Reflection / image of a language

Concatenation of languages

Set Operations

Star and cross

Properties of star

Cross

Blog Archive