A small collection of Ruby and JavaScript classes implementing often needed functionality for dealing with SMS messages.
The gem can also be used in a Rails application as an engine. It integrates with the asset pipeline and gives you access to some client-side SMS manipulation functionality.
The following features are available on both the server side and the client side:
- Detection of the most optimal encoding for sending an SMS message (GSM 7-bit or Unicode).
- Correctly determining a message's length in the most optimal encoding.
- Concatenation detection and concatenated message parts counting.
The following can be accomplished only on the server with Ruby:
- Converting a UTF-8 string to a GSM 7-bit encoding and vice versa.
- Detecting if a UTF-8 string can be safely represented in a GSM 7-bit encoding.
- Detection of double-byte chars in the GSM 7-bit encoding.
And possibly more.
All references to the "GSM" encoding or the "GSM 7-bit alphabet" in this text actually refer to the GSM 03.38 spec and its latest version, as defined by the Unicode consortium.
This encoding is the most widely used one when sending SMS messages.
The GSM 03.38 encoding is used by default. This standard defines a set of symbols which can be encoded in 7-bits each, thus allowing up to 160 symbols per SMS message (each SMS message can contain up to 140 bytes of data).
This standard covers most of the ASCII table, but also includes some non-ASCII
symbols such as æ
, ø
and å
. If you use these in your messages, you can
still send them as GSM encoded, having a 160-symbol limit. This is technically
correct.
In reality, however, some SMS routes have problems delivering messages which contain such non-ASCII symbols in the GSM encoding. The special symbols might be omitted, or the message might not arrive at all.
Thus, it might be safer to just send messages in Unicode if the message's text contains any non-ASCII symbols. This is not the default as it reduces the max symbols count to 70 per message, instead of 160, and you might not have any issues with GSM-encoded messages. In case you do, however, you can turn off support for the GSM encoding and just treat messages as Unicode if they contain non-ASCII symbols.
In case you decide to do so, you have to specify it in both the Ruby and the JavaScript part of the library, like so:
SmsTools.use_gsm_encoding = false
//= require sms_tools
SmsTools.use_gsm_encoding = false;
Add this line to your application's Gemfile:
gem 'smstools'
And then execute:
$ bundle
Or install it yourself as:
$ gem install smstools
If you're using the gem in Rails, you may also want to add the following to your application.js
manifest file to gain access to the client-side features:
//= require sms_tools
The gem consists of both server-side (Ruby) and client-side classes. You can use either.
First make sure you have installed the gem and have required the appropriate files.
The SmsTools::EncodingDetection
class provides you with a few simple methods to detect the most
optimal encoding for sending an SMS message, to correctly caclulate its length in that encoding and
to see if the text would need to be concatenated or will fit in a single message.
Here is an example with a non-concatenated message which is best encoded in the GSM 7-bit alphabet:
sms_text = 'Text in GSM 03.38: ÄäøÆ with a double-byte char: ~ '
sms_encoding = SmsTools::EncodingDetection.new sms_text
sms_encoding.gsm? # => true
sms_encoding.unicode? # => false
sms_encoding.length # => 52 (because of the double-byte char)
sms_encoding.concatenated? # => false
sms_encoding.concatenated_parts # => 1
sms_encoding.encoding # => :gsm
Here's another example with a concatenated Unicode message:
sms_text = 'Я' * 90
sms_encoding = SmsTools::EncodingDetection.new sms_text
sms_encoding.gsm? # => false
sms_encoding.unicode? # => true
sms_encoding.length # => 90
sms_encoding.concatenated? # => true
sms_encoding.concatenated_parts # => 2
sms_encoding.encoding # => :unicode
You can check the specs for this class for more examples.
The SmsTools::GsmEncoding
class can be used to check if a given UTF-8 string can be fully
represented in the GSM 03.38 encoding as well as to convert from UTF-8 to GSM 03.38 and vice-versa.
The main API this class provides is the following:
SmsTools::GsmEncoding.valid? message_text_in_utf8 # => true or false
SmsTools::GsmEncoding.from_utf8 utf8_encoded_string # => a GSM 03.38 encoded string
SmsTools::GsmEncoding.to_utf8 gsm_encoded_string # => an UTF-8 encoded string
Check out the source code of the class to find out more.
If you're using the gem in Rails 3.1 or newer, you can gain access to the SmsTools.Message
class.
It's interface is similar to the one of SmsTools::EncodingDetection
. Here is an example in
CoffeeScript:
message = new SmsTools.Message 'The text of the message: ~'
message.encoding # => 'gsm'
message.length # => 27
message.concatenatedPartsCount # => 1
You can also check how long can this message be in the current most optimal encoding, if we want to limit the number of concatenated messages we will allow to be sent:
maxConcatenatedPartsCount = 2
message.maxLengthFor(maxConcatenatedPartsCount) # => 306
This allows you to have a dynamic instead of a fixed length limit, for when you use a non-GSM 03.38 symbol in your text, your message length limit decreases significantly.
Note that to use this client-side code, a Rails application with an active asset pipeline is assumed. It might be possible to use it in other setups as well, but you're on your own there.
- Fork the repo
- Create your feature branch (
git checkout -b my-new-feature
) - Make your changes and provide tests for them
- Make sure all tests pass (run them with
bundle exec rake test
) - Commit your changes (
git commit -am 'Add some feature'
) - Push to the branch (
git push origin my-new-feature
) - Send a pull request.