Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

A

...

diacritic

...

also

...

called

...

a

...

diacritic

...

or

...

diacritical

...

mark

...

,

...

point

...

,

...

or

...

sign

...

,

...

is

...

a

...

small

...

sign

...

added

...

to

...

a

...

letter

...

to

...

alter

...

pronunciation

...

or

...

to

...

distinguish

...

between

...

similar

...

words.

...

In

...

codiing

...

we

...

often

...

come

...

across

...

them

...

while

...

reading

...

input

...

from

...

webpages.

...

There

...

might

...

be

...

some

...

API's

...

available

...

to

...

handle

...

the

...

task,

...

but

...

we

...

are

...

choosing

...

hard

...

way

...

to

...

do

...

that

...

to

...

save

...

time

...

to

...

explore

...

them

...

and

...

then

...

test

...

them.

...

The

...

easiest

...

part

...

is

...

to

...

find

...

them

...

and

...

replace

...

them

...

with

...

the

...

words

...

that

...

sounds

...

similar

...

to

...

them.

...

We

...

need

...

to

...

perform

...

the

...

following

...

steps

...

in

...

order

...

to

...

find

...

and

...

replace

...

them

...

with

...

similar

...

sounding

...

words.

...

 

  1. Download the page which contains diacritics eg: http://geoiptool.com/en/?IP=192.42.43.22

...

  1. contains

...

  1. Neuchatel

...

  1. with

...

  1. a

...

  1. as

...

  1. a

...

  1. diacritic.

...

  1.  
  2. From a UNIX machine get the dump of the page and grep the word so that you get hex dump of the alphabet required eg:
    Code Block
    
     xxd index.html?IP=192.42.43.22 

...

  1. | grep Neuch
    

...

    1. The

...

    1. output

...

    1. would

...

    1. be

...

    1. something

...

    1. like

...

    1. :
      Code Block
      
      

...

    1. 0001d00: 6c64 223e 4e65 7563 68e2 7465 6c3c 

...

    1. 2f74  ld">Neuch?tel</t
      

...

    1. Every

...

    1. two

...

    1. char

...

    1. at

...

    1. left

...

    1. represent

...

    1. one

...

    1. char

...

    1. at

...

    1. right

...

    1. and

...

    1. its

...

    1. starts

...

    1. after

...

    1. the

...

    1. colan":"

...

    1. eg

...

    1. 6c

...

    1. represents

...

    1. I

...

    1. and

...

    1. 64

...

    1. represents

...

    1. d

...

  1. Count

...

  1. the

...

  1. char

...

  1. to

...

  1. find

...

  1. the

...

  1. missing

...

  1. alphabet

...

  1. which

...

  1. "e2"

...

  1. in

...

  1. our

...

  1. case.

...

  1. Replace

...

  1. the

...

  1. alphabet

...

  1. using

...

  1. the

...

  1. pattern

...

  1. matching

...

  1. for

...

  1. hex

...

  1. by

...

  1. \x

...

  1. Code Block
    
    if($city =

...

  1. ~ m/\xe2/){
    

...

  1.       $city  =\~ s/\xe2/a/g;
    

...

  1.  }
    

...