Skip to main content

Text Encoding and Decoding using .Net



Namespace: System.Text
To read a file, an application must write information about the encoding in which the file was written. So, the application that creates a file must write encoding information, otherwise viewer will see block and question marks in file.

Unicode UTF-32 encoding:
Represents a character as sequence of four bytes (32 bits). UTF32Encoding class can be used in that perspective.

Unicode UTF-16 encoding:
Represents a character in 16 bits. UnicodeEncoding class can be used to convert to and from UTF-16 encoding.

Unicode UTF-8 encoding:
Unicode UTF-8 uses 8-bit, 16-bit, 24-bit, and up to 48-bit encoding. Values 0 through 127 use 8-bit encoding and exactly match ASCII values, 128 through 2047 use 16-bit encoding, and values from 2048 and 65535 uses 24-bit encoding. UTF8Encoding class can be used to convert to and from UTF-8 encoding.

ASCII encoding:
ASCII encoding encodes Latin alphabets as single 7-Bit ASCII character. It is not well suited for internationalized applications. ASCIIEncoding class can be used to convert to and from ASCII encoding.

ANSI/ISO Encoding:
System.Text.Encoding provides support for a wide range of ANSI/ISO encodings.

Using Encoding class:
GetEncoding will return the specified encoding then GetByte or GetChar methods can be used.

Encoding e = Encoding.GetEncoding(“Korean”);

Byte[] encoded;

encoded = e.GetBytes(“Hello word!”);

for(int i=0;i<encoded.length;i++)
{
                Console.WriteLine(“Byte {0}:{1}”, i,encoded[i]);
}

Supported Encoding Types:
To get a list of supported Encoding one can iterate the EncodingInfo struct returned by Encoding.GetEncodings().

EncodingInfo[] eis = Encoding.GetEncodings();

foreach( EncodingInfo ei in eis)
                Console.WriteLine(“{0}:{1} , {2}” , ei.CodePage, ei.Name, ei.DisplayName);

To Specify Encoding type when writing Files:

StreamWrite sw = new StreamWrite(“SampleUTF.txt”, false, Encoding.UTF32);
Sw.WriteLine(“Hello Word!”);
Sw.close();

To Specify Encoding When Reading Files:

StreamReader sr = new StreamReader(“SampleUTF.txt”, false, Encoding.UTF32);
Console.WriteLine(“Output is : {0}”, sr.ReadLine());
Sr.Close();

Comments

Popular posts from this blog

Culture Information and Localization in .NET

Namespace: System.Globalization CultureInfo Class:                 It provides information like the Format of numbers and dates, Culture’s Calendar, Culture’s language and sublanguage (if applicable), Country and region of the culture. The Basic use of CultureInfo class is shown here: • How string Comparisons are performed • How Number Comparison & Formats are performed • Date Comparison and Formats. • How resources are retrieved and used. Cultures are grouped into three categories: Invariant Culture : It’s Culture Insensitive. It can be used to build some trial application. It can be also used to build an application with hard-coded expiry date that ignores cultures. But using it for every comparison will be incorrect and inappropriate. Neutral Culture : English(en), Frensh(fr), and Spanish(sp). A neutral culture is related to language but it’s not related to specific regi...

Concept of App Domain in .Net

Creating Application Domains: Application domain is just like process, provides separate memory space, and isolates from other code. But it’s quite light weight. It also provides the following advantages: 1-       Reliability : If a domain crashes, it can be unloaded. Hence doesn’t affect the other assemblies. 2-       Efficiency : Loading all assemblies in one domain can be cumbersome and can make the process heavy but Appdomains are efficient in this manner. Important properties of AppDomain: ApplicationIdentity , ApplicationTrust , BaseDirectory , CurrentDomain , DomainManager , DomainDirectory , Evidence , FriendlyName , ID , RelativeSearchPath , SetupInformation , ShadowCopyFiles . Important methods of AppDomain: ApplyPolicy , CreateCOMInstanceFrom , CreateDomain , CreateInstance (Assembly). To create an AppDomain: AppDomain adomain = AppDomain.CreateDomain(“D”); To execute an assembly:...

ASP.NET Working With Data-Bound Web Server Controls

Suppose we have: List<Car> vCars = new List<Car>(); There are three types of databound controls: Simple databound controls(List, AdRotater), Composite data bound controls(GridView, DetailsView, FormView that inherit from CompositeDataBoundControl), and Hierarchal data bound controls (TreeView, Menu).   DataBoundControl has a DataBind method that can be used when data is ready. It calls DataBind for child controls as well. Page.DataBind() will call DataBind for all child controls. Using DataSource Objects:                                       BaseDataBound control exposes DataSource property that accepts objects that implement IEnumerable , IListSource , IDataSource , or IHierarchalDataSource . DataSourceID accepts ID of SqlDataSource . If both specified Data...