protein-translation
This commit is contained in:
parent
15f39c5c5b
commit
b5a8d35513
|
@ -0,0 +1,26 @@
|
||||||
|
{
|
||||||
|
"authors": [
|
||||||
|
"DoggettCK"
|
||||||
|
],
|
||||||
|
"contributors": [
|
||||||
|
"angelikatyborska",
|
||||||
|
"Cohen-Carlisle",
|
||||||
|
"devonestes",
|
||||||
|
"neenjaw",
|
||||||
|
"nwshane",
|
||||||
|
"sotojuan"
|
||||||
|
],
|
||||||
|
"files": {
|
||||||
|
"solution": [
|
||||||
|
"lib/protein_translation.ex"
|
||||||
|
],
|
||||||
|
"test": [
|
||||||
|
"test/protein_translation_test.exs"
|
||||||
|
],
|
||||||
|
"example": [
|
||||||
|
".meta/example.ex"
|
||||||
|
]
|
||||||
|
},
|
||||||
|
"blurb": "Translate RNA sequences into proteins.",
|
||||||
|
"source": "Tyler Long"
|
||||||
|
}
|
|
@ -0,0 +1 @@
|
||||||
|
{"track":"elixir","exercise":"protein-translation","id":"b19fcedc09cb4e6ab1ed560eaeb5cdec","url":"https://exercism.org/tracks/elixir/exercises/protein-translation","handle":"negrienko","is_requester":true,"auto_approve":false}
|
|
@ -0,0 +1,4 @@
|
||||||
|
# Used by "mix format"
|
||||||
|
[
|
||||||
|
inputs: ["{mix,.formatter}.exs", "{config,lib,test}/**/*.{ex,exs}"]
|
||||||
|
]
|
|
@ -0,0 +1,24 @@
|
||||||
|
# The directory Mix will write compiled artifacts to.
|
||||||
|
/_build/
|
||||||
|
|
||||||
|
# If you run "mix test --cover", coverage assets end up here.
|
||||||
|
/cover/
|
||||||
|
|
||||||
|
# The directory Mix downloads your dependencies sources to.
|
||||||
|
/deps/
|
||||||
|
|
||||||
|
# Where third-party dependencies like ExDoc output generated docs.
|
||||||
|
/doc/
|
||||||
|
|
||||||
|
# Ignore .fetch files in case you like to edit your project deps locally.
|
||||||
|
/.fetch
|
||||||
|
|
||||||
|
# If the VM crashes, it generates a dump, let's ignore it too.
|
||||||
|
erl_crash.dump
|
||||||
|
|
||||||
|
# Also ignore archive artifacts (built via "mix archive.build").
|
||||||
|
*.ez
|
||||||
|
|
||||||
|
# Ignore package tarball (built via "mix hex.build").
|
||||||
|
protein_translation-*.tar
|
||||||
|
|
|
@ -0,0 +1,75 @@
|
||||||
|
# Help
|
||||||
|
|
||||||
|
## Running the tests
|
||||||
|
|
||||||
|
From the terminal, change to the base directory of the exercise then execute the tests with:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
$ mix test
|
||||||
|
```
|
||||||
|
|
||||||
|
This will execute the test file found in the `test` subfolder -- a file ending in `_test.exs`
|
||||||
|
|
||||||
|
Documentation:
|
||||||
|
|
||||||
|
* [`mix test` - Elixir's test execution tool](https://hexdocs.pm/mix/Mix.Tasks.Test.html)
|
||||||
|
* [`ExUnit` - Elixir's unit test library](https://hexdocs.pm/ex_unit/ExUnit.html)
|
||||||
|
|
||||||
|
## Pending tests
|
||||||
|
|
||||||
|
In test suites of practice exercises, all but the first test have been tagged to be skipped.
|
||||||
|
|
||||||
|
Once you get a test passing, you can unskip the next one by commenting out the relevant `@tag :pending` with a `#` symbol.
|
||||||
|
|
||||||
|
For example:
|
||||||
|
|
||||||
|
```elixir
|
||||||
|
# @tag :pending
|
||||||
|
test "shouting" do
|
||||||
|
assert Bob.hey("WATCH OUT!") == "Whoa, chill out!"
|
||||||
|
end
|
||||||
|
```
|
||||||
|
|
||||||
|
If you wish to run all tests at once, you can include all skipped test by using the `--include` flag on the `mix test` command:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
$ mix test --include pending
|
||||||
|
```
|
||||||
|
|
||||||
|
Or, you can enable all the tests by commenting out the `ExUnit.configure` line in the file `test/test_helper.exs`.
|
||||||
|
|
||||||
|
```elixir
|
||||||
|
# ExUnit.configure(exclude: :pending, trace: true)
|
||||||
|
```
|
||||||
|
|
||||||
|
## Useful `mix test` options
|
||||||
|
|
||||||
|
* `test/<FILE>.exs:LINENUM` - runs only a single test, the test from `<FILE>.exs` whose definition is on line `LINENUM`
|
||||||
|
* `--failed` - runs only tests that failed the last time they ran
|
||||||
|
* `--max-failures` - the suite stops evaluating tests when this number of test failures
|
||||||
|
is reached
|
||||||
|
* `--seed 0` - disables randomization so the tests in a single file will always be ran
|
||||||
|
in the same order they were defined in
|
||||||
|
|
||||||
|
## Submitting your solution
|
||||||
|
|
||||||
|
You can submit your solution using the `exercism submit lib/protein_translation.ex` command.
|
||||||
|
This command will upload your solution to the Exercism website and print the solution page's URL.
|
||||||
|
|
||||||
|
It's possible to submit an incomplete solution which allows you to:
|
||||||
|
|
||||||
|
- See how others have completed the exercise
|
||||||
|
- Request help from a mentor
|
||||||
|
|
||||||
|
## Need to get help?
|
||||||
|
|
||||||
|
If you'd like help solving the exercise, check the following pages:
|
||||||
|
|
||||||
|
- The [Elixir track's documentation](https://exercism.org/docs/tracks/elixir)
|
||||||
|
- The [Elixir track's programming category on the forum](https://forum.exercism.org/c/programming/elixir)
|
||||||
|
- [Exercism's programming category on the forum](https://forum.exercism.org/c/programming/5)
|
||||||
|
- The [Frequently Asked Questions](https://exercism.org/docs/using/faqs)
|
||||||
|
|
||||||
|
Should those resources not suffice, you could submit your (incomplete) solution to request mentoring.
|
||||||
|
|
||||||
|
If you're stuck on something, it may help to look at some of the [available resources](https://exercism.org/docs/tracks/elixir/resources) out there where answers might be found.
|
|
@ -0,0 +1,69 @@
|
||||||
|
# Protein Translation
|
||||||
|
|
||||||
|
Welcome to Protein Translation on Exercism's Elixir Track.
|
||||||
|
If you need help running the tests or submitting your code, check out `HELP.md`.
|
||||||
|
|
||||||
|
## Instructions
|
||||||
|
|
||||||
|
Translate RNA sequences into proteins.
|
||||||
|
|
||||||
|
RNA can be broken into three nucleotide sequences called codons, and then translated to a polypeptide like so:
|
||||||
|
|
||||||
|
RNA: `"AUGUUUUCU"` => translates to
|
||||||
|
|
||||||
|
Codons: `"AUG", "UUU", "UCU"`
|
||||||
|
=> which become a polypeptide with the following sequence =>
|
||||||
|
|
||||||
|
Protein: `"Methionine", "Phenylalanine", "Serine"`
|
||||||
|
|
||||||
|
There are 64 codons which in turn correspond to 20 amino acids; however, all of the codon sequences and resulting amino acids are not important in this exercise.
|
||||||
|
If it works for one codon, the program should work for all of them.
|
||||||
|
However, feel free to expand the list in the test suite to include them all.
|
||||||
|
|
||||||
|
There are also three terminating codons (also known as 'STOP' codons); if any of these codons are encountered (by the ribosome), all translation ends and the protein is terminated.
|
||||||
|
|
||||||
|
All subsequent codons after are ignored, like this:
|
||||||
|
|
||||||
|
RNA: `"AUGUUUUCUUAAAUG"` =>
|
||||||
|
|
||||||
|
Codons: `"AUG", "UUU", "UCU", "UAA", "AUG"` =>
|
||||||
|
|
||||||
|
Protein: `"Methionine", "Phenylalanine", "Serine"`
|
||||||
|
|
||||||
|
Note the stop codon `"UAA"` terminates the translation and the final methionine is not translated into the protein sequence.
|
||||||
|
|
||||||
|
Below are the codons and resulting Amino Acids needed for the exercise.
|
||||||
|
|
||||||
|
| Codon | Protein |
|
||||||
|
| :----------------- | :------------ |
|
||||||
|
| AUG | Methionine |
|
||||||
|
| UUU, UUC | Phenylalanine |
|
||||||
|
| UUA, UUG | Leucine |
|
||||||
|
| UCU, UCC, UCA, UCG | Serine |
|
||||||
|
| UAU, UAC | Tyrosine |
|
||||||
|
| UGU, UGC | Cysteine |
|
||||||
|
| UGG | Tryptophan |
|
||||||
|
| UAA, UAG, UGA | STOP |
|
||||||
|
|
||||||
|
Learn more about [protein translation on Wikipedia][protein-translation].
|
||||||
|
|
||||||
|
[protein-translation]: https://en.wikipedia.org/wiki/Translation_(biology)
|
||||||
|
|
||||||
|
## Source
|
||||||
|
|
||||||
|
### Created by
|
||||||
|
|
||||||
|
- @DoggettCK
|
||||||
|
|
||||||
|
### Contributed to by
|
||||||
|
|
||||||
|
- @angelikatyborska
|
||||||
|
- @Cohen-Carlisle
|
||||||
|
- @devonestes
|
||||||
|
- @neenjaw
|
||||||
|
- @nwshane
|
||||||
|
- @sotojuan
|
||||||
|
|
||||||
|
### Based on
|
||||||
|
|
||||||
|
Tyler Long
|
|
@ -0,0 +1,68 @@
|
||||||
|
defmodule ProteinTranslation do
|
||||||
|
@doc """
|
||||||
|
Given an RNA string, return a list of proteins specified by codons, in order.
|
||||||
|
"""
|
||||||
|
@spec of_rna(String.t()) :: {:ok, list(String.t())} | {:error, String.t()}
|
||||||
|
def of_rna(""), do: {:ok, []}
|
||||||
|
|
||||||
|
def of_rna(rna) do
|
||||||
|
rna
|
||||||
|
|> String.graphemes()
|
||||||
|
|> Enum.chunk_every(3)
|
||||||
|
|> Enum.map(&Enum.join/1)
|
||||||
|
|> Enum.reduce_while([], &reducer/2)
|
||||||
|
|> case do
|
||||||
|
proteines when is_list(proteines) -> {:ok, Enum.reverse(proteines)}
|
||||||
|
{:error, error} -> {:error, error}
|
||||||
|
end
|
||||||
|
end
|
||||||
|
|
||||||
|
defp reducer(codon, proteines) do
|
||||||
|
case of_codon(codon) do
|
||||||
|
{:error, _error} -> {:halt, {:error, "invalid RNA"}}
|
||||||
|
{:ok, "STOP"} -> {:halt, proteines}
|
||||||
|
{:ok, proteine} -> {:cont, [proteine | proteines]}
|
||||||
|
end
|
||||||
|
end
|
||||||
|
|
||||||
|
@doc """
|
||||||
|
Given a codon, return the corresponding protein
|
||||||
|
|
||||||
|
UGU -> Cysteine
|
||||||
|
UGC -> Cysteine
|
||||||
|
UUA -> Leucine
|
||||||
|
UUG -> Leucine
|
||||||
|
AUG -> Methionine
|
||||||
|
UUU -> Phenylalanine
|
||||||
|
UUC -> Phenylalanine
|
||||||
|
UCU -> Serine
|
||||||
|
UCC -> Serine
|
||||||
|
UCA -> Serine
|
||||||
|
UCG -> Serine
|
||||||
|
UGG -> Tryptophan
|
||||||
|
UAU -> Tyrosine
|
||||||
|
UAC -> Tyrosine
|
||||||
|
UAA -> STOP
|
||||||
|
UAG -> STOP
|
||||||
|
UGA -> STOP
|
||||||
|
"""
|
||||||
|
@cysteines ~w(UGU UGC)
|
||||||
|
@leucines ~w(UUA UUG)
|
||||||
|
@tyrosines ~w(UAU UAC)
|
||||||
|
@phenylalanines ~w(UUU UUC)
|
||||||
|
@serines ~w(UCU UCC UCA UCG)
|
||||||
|
@methionines ~w(AUG)
|
||||||
|
@tryptophans ~w(UGG)
|
||||||
|
@stops ~w(UAA UAG UGA)
|
||||||
|
|
||||||
|
@spec of_codon(String.t()) :: {:ok, String.t()} | {:error, String.t()}
|
||||||
|
def of_codon(cysteine) when cysteine in @cysteines, do: {:ok, "Cysteine"}
|
||||||
|
def of_codon(leucine) when leucine in @leucines, do: {:ok, "Leucine"}
|
||||||
|
def of_codon(tyrosine) when tyrosine in @tyrosines, do: {:ok, "Tyrosine"}
|
||||||
|
def of_codon(phenylalanine) when phenylalanine in @phenylalanines, do: {:ok, "Phenylalanine"}
|
||||||
|
def of_codon(serine) when serine in @serines, do: {:ok, "Serine"}
|
||||||
|
def of_codon(methionine) when methionine in @methionines, do: {:ok, "Methionine"}
|
||||||
|
def of_codon(tryptophan) when tryptophan in @tryptophans, do: {:ok, "Tryptophan"}
|
||||||
|
def of_codon(stop) when stop in @stops, do: {:ok, "STOP"}
|
||||||
|
def of_codon(_invalid), do: {:error, "invalid codon"}
|
||||||
|
end
|
|
@ -0,0 +1,28 @@
|
||||||
|
defmodule ProteinTranslation.MixProject do
|
||||||
|
use Mix.Project
|
||||||
|
|
||||||
|
def project do
|
||||||
|
[
|
||||||
|
app: :protein_translation,
|
||||||
|
version: "0.1.0",
|
||||||
|
# elixir: "~> 1.8",
|
||||||
|
start_permanent: Mix.env() == :prod,
|
||||||
|
deps: deps()
|
||||||
|
]
|
||||||
|
end
|
||||||
|
|
||||||
|
# Run "mix help compile.app" to learn about applications.
|
||||||
|
def application do
|
||||||
|
[
|
||||||
|
extra_applications: [:logger]
|
||||||
|
]
|
||||||
|
end
|
||||||
|
|
||||||
|
# Run "mix help deps" to learn about dependencies.
|
||||||
|
defp deps do
|
||||||
|
[
|
||||||
|
# {:dep_from_hexpm, "~> 0.3.0"},
|
||||||
|
# {:dep_from_git, git: "https://github.com/elixir-lang/my_dep.git", tag: "0.1.0"}
|
||||||
|
]
|
||||||
|
end
|
||||||
|
end
|
|
@ -0,0 +1,139 @@
|
||||||
|
defmodule ProteinTranslationTest do
|
||||||
|
use ExUnit.Case
|
||||||
|
|
||||||
|
describe "of_codon" do
|
||||||
|
test "AUG translates to methionine" do
|
||||||
|
assert ProteinTranslation.of_codon("AUG") == {:ok, "Methionine"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies Phenylalanine codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UUU") == {:ok, "Phenylalanine"}
|
||||||
|
assert ProteinTranslation.of_codon("UUC") == {:ok, "Phenylalanine"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies Leucine codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UUA") == {:ok, "Leucine"}
|
||||||
|
assert ProteinTranslation.of_codon("UUG") == {:ok, "Leucine"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies Serine codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UCU") == {:ok, "Serine"}
|
||||||
|
assert ProteinTranslation.of_codon("UCC") == {:ok, "Serine"}
|
||||||
|
assert ProteinTranslation.of_codon("UCA") == {:ok, "Serine"}
|
||||||
|
assert ProteinTranslation.of_codon("UCG") == {:ok, "Serine"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies Tyrosine codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UAU") == {:ok, "Tyrosine"}
|
||||||
|
assert ProteinTranslation.of_codon("UAC") == {:ok, "Tyrosine"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies Cysteine codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UGU") == {:ok, "Cysteine"}
|
||||||
|
assert ProteinTranslation.of_codon("UGC") == {:ok, "Cysteine"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies Tryptophan codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UGG") == {:ok, "Tryptophan"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "identifies stop codons" do
|
||||||
|
assert ProteinTranslation.of_codon("UAA") == {:ok, "STOP"}
|
||||||
|
assert ProteinTranslation.of_codon("UAG") == {:ok, "STOP"}
|
||||||
|
assert ProteinTranslation.of_codon("UGA") == {:ok, "STOP"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "incomplete codon" do
|
||||||
|
assert ProteinTranslation.of_codon("UG") == {:error, "invalid codon"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "too long, invalid codon" do
|
||||||
|
assert ProteinTranslation.of_codon("UGGG") == {:error, "invalid codon"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "known amino acids, but invalid codon" do
|
||||||
|
assert ProteinTranslation.of_codon("AAA") == {:error, "invalid codon"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "unknown amino acids, not part of a codon" do
|
||||||
|
assert ProteinTranslation.of_codon("XYZ") == {:error, "invalid codon"}
|
||||||
|
end
|
||||||
|
end
|
||||||
|
|
||||||
|
describe "of_rna" do
|
||||||
|
test "empty RNA sequence results in no proteins" do
|
||||||
|
strand = ""
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, []}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "translates rna strand into correct protein" do
|
||||||
|
strand = "AUGUUUUGG"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Methionine Phenylalanine Tryptophan)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "sequence of two identical protein codons translates into two identical proteins" do
|
||||||
|
strand = "UUUUUU"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Phenylalanine Phenylalanine)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "sequence of two different protein codons translates into two identical proteins" do
|
||||||
|
strand = "UUAUUG"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Leucine Leucine)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "stops translation if stop codon preset at beginning of sequence" do
|
||||||
|
strand = "UAGUGG"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w()}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "stops translation if stop codon present at end of two-codon sequence" do
|
||||||
|
strand = "UGGUAG"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Tryptophan)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "stops translation if stop codon present at end of three-codon sequence" do
|
||||||
|
strand = "AUGUUUUAA"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Methionine Phenylalanine)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "stops translation if stop codon present in middle of three-codon sequence" do
|
||||||
|
strand = "UGGUAGUGG"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Tryptophan)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "stops translation if stop codon present in middle of six-codon sequence" do
|
||||||
|
strand = "UGGUGUUAUUAAUGGUUU"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Tryptophan Cysteine Tyrosine)}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "incomplete codon, invalid RNA" do
|
||||||
|
strand = "UG"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:error, "invalid RNA"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "known amino acids, but invalid codon, invalid RNA" do
|
||||||
|
strand = "AAA"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:error, "invalid RNA"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "unknown amino acids, not part of a codon, invalid RNA" do
|
||||||
|
strand = "XYZ"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:error, "invalid RNA"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "invalid codon at end of RNA" do
|
||||||
|
strand = "UUUROT"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:error, "invalid RNA"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "incomplete RNA" do
|
||||||
|
strand = "AUGU"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:error, "invalid RNA"}
|
||||||
|
end
|
||||||
|
|
||||||
|
test "incomplete RNA valid until a STOP codon" do
|
||||||
|
strand = "UUCUUCUAAUGGU"
|
||||||
|
assert ProteinTranslation.of_rna(strand) == {:ok, ~w(Phenylalanine Phenylalanine)}
|
||||||
|
end
|
||||||
|
end
|
||||||
|
end
|
|
@ -0,0 +1,2 @@
|
||||||
|
ExUnit.start()
|
||||||
|
ExUnit.configure(exclude: :pending, trace: true)
|
Loading…
Reference in New Issue